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Chapter 1 



Introduction 



This work is a continuation of the works by Andreka, Madarasz, Nemeti and their 
coauthors, e.g., [1], [2], [3], [4], [34]. Our research is directly related to Hilbert's sixth 
problem of axiomatization of physics. Moreover, it goes beyond this problem since its 
general aim is not only to axiomatize physical theories but to investigate the relationship 
between basic assumptions (axioms) and predictions (theorems). Another general aim 
of ours is to provide a foundation for physics similar to that of mathematics. 

For good reasons, the foundation of mathematics was performed strictly within 
first-order logic (FOL). A reason for this fact is that staying within FOL helps to avoid 
tacit assumptions. Another reason is that FOL has a complete inference system while 
second-order logic (and thus any higher-order logic) cannot have one, see, e.g., [20, 
§IX. 1.6]. For further reasons for staying within FOL, see, e.g.. Chap. 11 and [10], [2, 
§ Appendix: Why FOL?], [78], [82]. 

Why is it useful to apply the axiomatic method to relativity theory? For one 
thing, this method makes it possible for us to understand the role of any particular 
axiom. We can check what happens to our theory if we drop, weaken or replace an 
axiom. For instance, it has been shown by this method that the impossibility of faster 
than light motion is not independent from other assumptions of special relativity, see 
[2, §3.4], [3]. More boldly: it is superfluous as an axiom because it is provable as a 
theorem from much simpler and more convincing basic assumptions. The linearity of 
the transformations between observers (reference frames) can also be proven from some 
plausible assumptions, therefore it need not be assumed as an axiom, see Thm. 3.2.2 
and [2], [3]. Getting rid of unnecessary axioms of a physical theory is important because 
we do not know whether an axiom is true or not, we just assume so. We can only be sure 
of experimental facts but they typically correspond not to axioms but to (preferably 
existentially quantified) intended theorems. 



Not only can we get rid of superfluous assumptions by applying the axiomatic 
method, but we can discover new, interesting and physically relevant theories. That 
happened in the case of the axiom of parallels in Euclid's geometry; and this kind of 
investigation led to the discovery of hyperbolic geometry. Our FOL theory of acceler- 
ated observers (AccRel), which nicely fills the gap between special and general relativity 
theories, is also a good example of such a theory. 

Moreover, if we have an axiom system, we can ask which axioms are responsible for 
a certain prediction of our theory. This kind of reverse thinking helps to answer the 
why-type questions of relativity. For example, we can take the twin paradox theorem 
and check which axiom of special relativity was and which one was not needed to derive 
it. The weaker an axiom system is, the better answer it offers to the question: "Why is 
the twin paradox true?". The twin paradox is investigated in this manner in Chap. 7, 
while its inertial approximation (called the clock paradox) in Chap. 4. For details on 
answering why-type questions of relativity by the methodology of the present work, see 
[74]. We hope that we have given good reasons why we use the axiomatic method in 
our research into spacetime theories. For more details and further reasons, see, e.g.. 
Guts [29], Schutz [62], Suppes [67]. 

This work is structured in the following way: in Chap. 2 we introduce our FOL frame 
and our basic notation; then, in Chap. 3, we recall a FOL axiomatization of special 
relativity by our research group. Based on this axiomatization first we investigate 
the logical connection between the clock paradox theorem and the axiom system in 
Chap. 4. First we give a geometrical characterization theorem for the clock paradox, 
see Thm. 4.3.6; then we prove some surprising consequences for both Newtonian and 
relativistic kinematics. Thm. 4.5.3 answers Question 4.2.17 of Andreka-Madarasz- 
Nemeti [2]. 

In Chap. 5 we extend our geometrical approach to relativistic dynamics and inves- 
tigate the relations between our purely geometrical key axioms of dynamics and the 
conservation postulates of the standard approaches. For example, we show that the 
conservation postulates are not needed to prove the relativistic mass increase theorem 
rrio = a/1 — w^/c^ ■ m, which is the first step to capture Einstein's insight E = mc^. 

In Chap. 6 we extend the theory introduced in Chap. 3 to accelerated observers by 
introducing our aforementioned theory AccRel, which is the main subject of this thesis. 
In Chap. 7 we investigate the twin paradox within AccRel; we show that a nontrivial 
assumption is required if we want the twin paradox to be a consequence of our theory 
AccRel. In Chap. 8 we prove two formulations of the gravitational time dilation from a 
streamlined and small set of axioms (AccRel), by using Einstein's equivalence principle. 



In Chap. 9 we "derive" a FOL axiom system of general relativity from our theory 
AccRel in one natural step. The technical parts of the proofs and the development of 
the necessary tools are presented in Chap. 10. And in Chap. 11 we go into the details 
of the reasons for choosing FOL in our investigation. 

Convention 1.0.1. Throughout this work, there appear "highlighted" statements, 
such as AxCenter^ in Chap. 5, which associate the name AxCenter"*" with a formula of 
our FOL language. It is important to note that these formulas are not automatically 
elevated to the rank of axiom. Instead, they serve as potential axioms or even as 
potential statements to appear in theorems, hence they are nothing more than formulas 
distinguished in our language. 

We try to be as self-contained as possible. First occurrences of concepts used in 
this work are set in boldface to make them easier to find. We also use colored text 
and boxes to help the reader to find the axioms, notations, etc. Throughout this work, 
if-and-only-if is abbreviated as iff. We hope that the Index at the end of this thesis 
also helps find the individual definitions and notations introduced. 
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Chapter 2 



FOL frame 



In this chapter we specify the FOL frame within which we will work. 

2.1 Frame language 

Here we explain our basic concepts. This thesis mainly deals with the kinematics of 
relativity, i.e., with the motion of bodies (things which can move, e.g., test-particles, 
reference frames, center-lines). However, we briefly discuss dynamics in Chap. 5, and 
our co-authored papers [6], [7] and [39] are fully devoted to dynamics. We represent 
motion as the changing of spatial location in time. Thus we use reference frames for 
coordinatizing events (meetings of bodies). Quantities are used for marking time and 
space. The structure of quantities is assumed to be an ordered field in place of the field 
of real numbers. For simplicity, we associate reference frames with certain bodies called 
observers. The coordinatization of events by observers is formulated by means of the 
worldview relation. We visualize an observer as "sitting" at the origin of the space part 
of its reference frame, or equivalently, "living" on the time-axis of the reference frame. 
We distinguish inertial and noninertial ohseiveis. For the time being, inertiality is only 
a label on observers which will be defined later by our axioms. We also use another 
special kind of bodies called photons. We use photons only for labeling light paths, so 
here we do not consider any of their quantum dynamical properties. 

In an axiomatic approach to relativity, it is more natural to take bodies instead 
of events as basic concepts. This choice is not uncommon in the literature, see, e.g.. 
Ax [10], Benda [14]. However, a large variety of choosing basic concepts occur in the 
different axiomatizations of special relativity, see, e.g., Goldblatt [28], Mundy [46, 47], 
Pambuccian [49], Robb [53], [54] Suppes [68], Schutz [62], [64], [63]. 



Allowing ordered fields in place of the field of real numbers increases the flexibility of 
our theory and minimizes the amount of our mathematical presuppositions. For further 
motivation in this direction, see, e.g.. Ax [10]. Similar remarks apply to our flexibility- 
oriented decisions below, e.g., to treat the dimension of spacetime as a variable. 

There are many reasons for using observers (or coordinate systems, or reference 
frames) instead of a single observer-independent spacetime structure. One is that it 
helps to weed unnecessary axioms from our theories. Nevertheless, we state and empha- 
size the logical equivalence^ of observer-oriented and observer- independent approaches 
to relativity theory, see [4, §3.6], [34, §4.5]. 

Keeping the foregoing in mind, let us now set up the FOL language of our axiom 
systems. First we fix a natural number d > 2 for the dimension of spacetime. Our 
language contains the following nonlogical symbols: 

• unary relation symbols B (bodies), Ob (observers), 10b (inertial observers), 
Ph (photons) and Q (quantities); 

• binary function symbols -|- , ■ and a binary relation symbol < (field operations 
and ordering on Q); and 

• a 2 -|- d-aiy relation symbol W (worldview relation). 

B(x), Ob(x), lOb(x), Ph(x) and Q(x) are translated as "x is a body," "x is an ob- 
server," "a: is an inertial observer," "a; is a photon" and "x is a quantity," respec- 
tively. We use the worldview relation W to speak about coordinatization by trans- 
lating W(x,|/, 2;i, . . . , 2^) as "observer x coordinatizes body y at spacetime location 
(zi, . . . , 2;^)," (i.e., at space location (2:2, . . . ,Zd) and at instant Zi). 

B(x), Ob(x), lOb(x), Ph(x), Q(x), W{x,y,Zi, . . . ,Zd), x = y and x < y are the 
so-called atomic formulas of our FOL language, where x, y, zi, . . . ,Zd can be arbitrary 
variables or terms built up from variables by using the field operations. The formulas 
of our FOL language are built up from these atomic formulas by using the logical 
connectives not (->), and (A), or {y), implies (—!■), if-and-only-if (■^), and the 
quantifiers exists x ( 3x ) and for all x ( Vx ) for every variable x. To abbreviate formulas 
of FOL we often omit parentheses according to the following convention: quantifiers 
bind as long as they can, and A binds stronger than — )■. For example, we write Vx </) A 
tp —>■ 3y 5 Ar] instead of Vx((v9 A 4-') ~^ 3y(5 A r/)). 

We use the notation Q" :=Q x . . . x Q (ra-times) for the set of all n-tuples of elements 
of Q. If p G Q", we assume that p = (pi, . . . ,Pn), i.e., Pi E Q denotes the i-th com- 



^By logical equivalence, we mean definitional equivalence. 



ponent of the n-tuple p. Specially, we write W(m, h.-p) in place of W(m, b,pi, . . . ,Pd), 
and we write Vp in place of Vpi . . . Vp^, etc. 

To abbreviate formulas, we also use bounded quantifiers in the following way: 
3x (p{x) A tp and Vx ip{x) — )■ ip are abbreviated to 3x E ip tp and "ix E ip ijj, re- 
spectively. For example, to formulate that every observer observes a body somewhere, 
we write 

VmeOb36eB3pGQ'^ W{m,b,p) 

instead of 

Vm Ob(m) -^ 3b B(6) A 3^ Q(pi) A ... A Q(prf) A W{m,b,p). 

We use FOL set theory as a metatheory to speak about model theoretical concepts, 
such as models, validity, etc. 

The models of this language are of the form 

dJl = {U; Bot, Ohjji, lObsut, Ph^, Q<sx, +<m, "ot, <ot, ^<m), 

where ?7 is a nonempty set, and B^, Ob™, lObgR, Phgjt and Qgji are unary relations on 
U, etc. Formulas are interpreted in DJl in the usual way. 

Let S and F be sets of formulas, and let (f and ip be formulas of our language. Then 
E logically implies ip, in symbols S |= </?, iff (/? is true in every model of S, (i.e., if is 
a logical consequence of S). T, ^ (p denotes that there is a model of S in which if is 
not true. To simplify our notations, we use the plus sign between formulas and sets of 
formulas in the following way: S + F :=S U F, ip + tp '■={'P, ip} and T, + ip :=S U {</?}. 

Remark 2.1.1. Let us note that the fewer axioms S contains, the stronger the logical 
implication H \= ip is, and similarly the more axioms S contains the stronger the 
counterexample S ^ y? is. 

Remark 2.1.2. By Godel's completeness theorem, all the theorems of this thesis 
remain valid if we replace the relation of logical consequence (|=) by the deducibility 
relation of FOL (h). 

2.2 Frame axioms 

Here we introduce two axioms that are going to be treated as part of our logic frame. 
Our first axiom expresses very basic assumptions, such as: both photons and observers 
are bodies, inertial observers are also observers, etc. 
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AxFrame Ob U Ph C B, lOb C Ob, W C Ob x B x Q'^, B n Q = 0; + and ■ are binary 
operations, and < is a binary relation on Q^. 

Instead of using this axiom we could also use many-sorted FOL language as in [2] and 
[4], and only assume that lOb C Ob. 

To be able to add, multiply and compare measurements by observers, we provide 
an algebraic structure for the set of quantities with the help of our next axiom. 

AxEOF The quantity part (Q; +, -, <) is a Euclidean ordered field, i.e., a linearly 
ordered field in which positive elements have square roots. 

For the FOL definition of linearly ordered field, see, e.g., [15]. We use the usual field 
operations 0, 1, — , /, v and binary relation < , definable within FOL. We also use the 
vector-space structure of Q", i.e., p + q, — p , \ ■ p G Q" if p , g G Q" and A G Q; and 
o : = (0, . . . , 0) denotes the origin. 

Convention 2.2.1. We treat AxFrame and AxEOF as part of our logic frame. Hence 
without any further mentioning, they are always assumed and will be part of each 
axiom system we propose herein, except in some of the theorems of Chap. 10. 

2.3 Basic definitions and notations 

Let us collect here the basic definitions and notations that are going to be used in the 
following chapters. 

Remark 2.3.1. In our formulas we seek to use only FOL definable concepts. So 
we will always warn the reader whenever we introduce a concept which is not FOL 
definable in our language. 

The ordered field of real numbers, which is not FOL definable in our language, is 
denoted by M . The composition of binary relations R and S is defined as: 

RoS:={{a,c): 3b (a, b) e R A {b,c) eS}. 

The domain and the range of a binary relation R are denoted by 

Dom R:= {a : 3b {a,b) ^ R} and Ran R:= {b : 3a {a,b) E R} , 

respectively. R~^ denotes the inverse of R, i.e., 

R-^:={{b,a) : {a,b) E R} . 



^These statements can easily be translated to our FOL language, e.g., formula Vxy x < y 
Q(a-) A Q{y) means that "< is a binary relation on Q." 
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Remark 2.3.2. We think of a function as a special binary relation. Notation 
f : A ^ B denotes that / is a function from A to 5, i.e., Dom f = A and Ran f ^ B. 
Note that if / and g are functions, then 



(/°^)(a:)=^(/(x)) 



for all X G Dom fog. Notation f : A ^ B denotes that / is a partial function on A, 
i.e., Dom f ^ A and Ran f ^ B. 

The identity map on /J C Q'' is defined as: 

/dH:={(p,p)GQ''xQ'^ : p^H], 
and the restriction of a function / to a set H is defined as: 

f\^:={{x,y) : xE Dom fnH A f{x) = y] . 
The set of positive elements of Q is denoted by 

Qt:={xe Q :0 <x}, 
and the different kinds of interval between x, y G Q are defined as: 

(x, ?/):={tGQ : x<t<y or y<t<x}, 

[x,y]:= {t E Q : x < t < y or y < t < x} , 

[x,?/):={tGQ : X <t < y or y < t < x} , and 

(x,t/]:={tGQ : x<t<y or y<t<x}. 

We use this nonstandard but convenient notion of intervals to avoid inconveniences of 
empty intervals, such as (1,0) in the standard notion. By our definition (1,0) is not 
the empty set but the interval (0, 1). 
For any n > 1, the Euclidean length of p G Q" is defined as: 



\p\-=\/pi + ■■■+pI- 

Hence \x\ is the absolute value of x if x G Q. The (open) ball with center p G Q"' and 
radius r G Q'^ is defined as: 

5r(p):={g eQ" : \p-q\<r}, 

A set G C Q" is called open iff for all p G G there is an e G Q"*" such that B^ip) C G. 
A set iJ C Q is called connected iff (x, y) '^ H for all x,y E H. We say that a 
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function 7 : Q — )■ Q*^ is a curve if Dom 7 is connected and has at least two distinct 
elements. The standard basis vectors of Q'' are denoted by Ij , i.e., 



I 



1,: = (0,...,1,...,0) 

for all 1 < i < d. We also use notations Ij, l^, ly and 1^ instead of li, I2, I3, and I4, 
respectively. The line passing through p and q is defined as: 

line{p,q):={p + \{p - q) : AgQ}. 

Let us note that line{p,p) = {p} by this definition. It is practical to introduce a 
notation for the tx-plane: 

tx-Plane := { p e Q'^ : p3 = . . . = Pd = O} . 

2.4 Some fundamental concepts related to 
relativity 

Let us gather here some fundamental definitions and notations which are used in the 
following chapters. The set Q*^ is called the coordinate system and its elements are 
referred to as coordinate points. We use the notations 

Pa- = {P2,---,Pd) and Pr-=Pl 

for the space component and the time component of p G Q'^, respectively. The 
event evm{p) is defined as the set of bodies observed by observer m at coordinate 
point p, i.e., 

evrn{p):={h : W{m,b,p)}. 

The function that maps p to eVm{p) is also denoted by eVm- Event e is said to be 
encountered by observer k if there is a coordinate point g such that k G evk{q) = e. 
Let Evm denote the set of nonempty events coordinatized by observer m, i.e., 

Evm := { e : 3p G Q'^ ev^,{p) = e 7^ } , 

and let Ev denote the set of all observed events, i.e., 

Ev := { e : 3m G Ob e G Evm } ■ 

We say that events ei and 62 are simultaneous for observer m, in symbols ei ~me2, 
iff there are coordinate points p and g such that eVmip) = ei, eVm{q) = 62, andp,- = g,-. 
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Figure 2.1: Illustration of the basic definitions 
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Remark 2.4.1. It is easy to see that ~m is a reflexive and symmetric relation for 
every observer m; however, it is not an equivalence relation unless we assume further 
axioms. 

The coordinate-domain of observer m, in symbols Cd^, is the set of coordinate 
points where m observes something (a nonempty event), i.e., 

Cdm:={peQ'' : evmip)^^}. 

The worldview transformation between the coordinate-systems of observers k 
and m is deflned as the set of the pairs of coordinate points in which k and m coordi- 
natize the same nonempty event, i.e.. 



k 

w : 

m 



{ {p, q) G Q" X Q-^ : eVk{p) = ev^{q) ^ } 



Let us note that by this definition worldview transformations are only binary relations 
but axiom AxPHq, defined below on p. 19, turns them into functions, see Prop. 3.1.3. 

Convention 2.4.2. Whenever we write w^{p), we mean there is a unique g* G Q'^ 
such that {p,q) G w'^, and w'^{p) denotes this unique q. That is, if we talk about 
the value w^{p) of w^ at g, we postulate that it exists and is unique (by the present 
convention) . 

Since in axiomatic approaches we only assume what is explicitly stated by the 
axioms, we have to prove every other statement, even the plausible ones. So let us 
prove a proposition here about the basic properties of worldview transformations. 

Proposition 2.4.3. Let m and k be observers. Then 

(1) wl^ Idcdk, and 

(2) w^ = Idcdk iff k does not see any nonempty event twice, i.e., w^ is a function. 

(3) u;^ o u;™ D Idnomw^, and 

(4) w'^ow]^ = Idoomu,^;, iff w^m is injective. 

(5) w^ow'^^ wt, and 

(6) wlow^ = vol, iff Evk n Ev„, C Evh- 

Proof . Items (1) and (2) can be easily proved by checking the respective definitions. 
To prove Item (3), let p E Domwl,. Then, by our definitions, there is a g*G Q'' such 
that evkip) = evm{q) 7^ 0, i.e., {p,q) G w'^. Then, by our definition of worldview 
transformation, {q,p) G w;™. Consequently, {p,p) G w^ow]^, which was to be proved. 
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Let us now prove Item (4). If w'^ is not injective, there are pi,p2,q G Q'^ such that 
Pi 7^ P2 and (pi, q), {p2, q) G w^. Then (^1,^2) G tf^ o ty™. So w'^ has to be injective 

To prove the converse implication, let {p,f) G w'^ o w;™. Then there is a g*G Q'^ such 
that {p,q) G w'^ and (g, r) G w"^. So (r, g ) G tf^, and thus we get that p = rhj the 
injectivity of w^. So (p, r) G Idj^omwi' as it was required. 

Items (5) and (6) can be easily proved by checking the respective definitions. ■ 

The world-line of body h according to observer m is defined as the set of coordinate 
points where b was observed by m, i.e., 

^\m{h):={p^qf : W(m,6,p)}. 

Let us note here that both p G wlfc(6) and h G eVk{p) represent the atomic formula 
W{k,b,p), but from slightly different aspects. 

The location loCm(e) of event e according to observer m is defined as pii evm{p) = 
e and there is only one such p G Q''; otherwise loCm(e) is undefined. Event e is called 
localized by observer m if it has a unique coordinate according to m, i.e., loCm(e) is 
defined. To express that in our formulas, we use LoCm(e) as an abbreviation for the 
following formula: 

3p G Q'' evm{p) = e A Vg G Q"* evm{q) = e -^ p = q. 

Convention 2.4.4. We use the equation sign "=" in the sense of existential equality, 
i.e., a = (3 denotes that both a and (3 are defined and they are equal. We also use the 
same convention for other relations (e.g., for "<"). See [34, Conv.2.3.10, p. 31] and [2, 
Conv.2.3.10, p.61]. 

Remark 2.4.5. Let us note that locfc(e) = i? means that evk{p) = e and p = qioi all 
g for which evk{q) = e by Conv. 2.4.4. 

Remark 2.4.6. Let us note that \oCm{evk{p)) is defined iff w'^{p) is so and in this 
case they are the same, i.e., 

wtip) = \oc^{eVk{p)). 
The time of event e according to observer m is defined as: 

time^(e):=loCm(e)^ 
15 



if e is localized by m; otherwise timem(e) is undefined. The elapsed time between 
events ei and 62 measured by observer m is defined as: 

time„(ei,e2):=|timem(ei) -timem(e2)| 

if ei and 62 are localized by m; otherwise timem(ei,e2) is undefined. timem(ei,e2) is 
called the proper time measured by m between ei and 62 if m G ei ne2. The spatial 
location of event e according to observer m is defined as: 

space^(e) :=loc„(e)^ 

if e is localized by m; otherwise space„(e) is undefined. The spatial distance between 
events ei and 62 according to observer m is defined as: 

dist„(ei,e2): = |space^(ei) -space^(e2)| 

if ei and 62 are localized by m; otherwise distm(ei, 62) is undefined. 

Spacetime vector f ^ Q'^ is called spacelike iff |fv| > \rr\, lightlike iff \r„\ = 
Ir^-I, and timelike iff \f„\ < |r^|. Spacetime vectors p and q are called spacelike- 
separated, in symbols paq,iSp — q is a spacelike vector; lightlike-separated, in 
symbols p X q, iff p — g is a lightlike vector; and timelike-separated, in symbols 
p T q, iff p — g is a timelike vector. Events ei and 62 which are localized by every 
merim/ observer are called spacelike-separated (lightlike-separated; timelike-separated), 
in symbols ei a 62 (ei A 62; ei r 62), iff loCm(ei) and loCm(e2) are such for every inertial 
observer m. A curve 7 is called timelike iff it is differentiable (see p. 106), and 7'(t) is 
timelike for all t G Dorwy. 

Coordinate points p and q are called Minkowski orthogonal, in symbols p _L^ q, 
iff the following holds: Pr ■ qr = P2 ' (I2 + ■ ■ ■ + Pa ' Qd- The (signed) Minkowski length 
of p G Q'' is 



,^. , JpI-Wct? if pl>\Pa? 

Hip):-- 



IPo-P — Pt i^ other cases, 

and the Minkowski distance between p and q is fi{p,q) '■=fJ'{p — q)- We use the 
signed version of the Minkowski length because it contains two kinds of information: 
(i) the length of p, and (ii) whether it is spacelike, lightlike or timelike. A map 
/ : Q*^ — )■ Q"^ is called a Poincare transformation iff it is a transformation pre- 
serving the Minkowski distance, i.e., /i(/(p), /(?)) = /u(p, ? ) for all p , g G Q*^. Like 
transformations preserving Euclidean distance, Poincare transformations are also affine 
ones. Linear transformations preserving the Minkowski distance are called Lorentz 
transformations. 
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Chapter 3 
Special relativity 



In this chapter we axiomatize special relativity within our FOL frame. The axiom 
system SpecRel, which we introduce in this chapter, is a kind of basic axiom system that 
we will extend and transform in the forthcoming chapters. Here we also discuss some 
important properties of SpecRel, such as its completeness with respect to Minkowskian 
geometries over Euclidean ordered fields or the possible worldview transformations 
between inertial observers. 

3.1 Special relativity in four simple axioms 

Here we formulate four simple and plausible axioms that capture special relativity. We 
seek to formulate easily understandable axioms in FOL. We present each axiom at 
two levels. First we give an intuitive formulation, then a precise formalization using 
our logical notations. We give the pure FOL version of the first three axioms only. 
However, all the axioms in this thesis can also be translated easily into FOL formulas 
by inserting the respective FOL definitions into the formalizations of the axioms. 

Let us now formulate our first axiom on observers. Historically, this natural axiom 
goes back to Galileo Galilei or even to d'Oresme of around 1350, see, e.g., [3, p. 23, 
§5], but it is very probably a prehistoric assumption, see Rem. 3. LI. It simply states 
that each observer assumes that it rests at the origin of the space part of its coordinate 
system. 

AxSelfo An observer coordinatizes itself at a coordinate point iff it is in the observer's 
coordinate domain and its space component is the origin: 

Vm G Ob 'ip G Q'^ m ^ eVm{p) -v^ p E Cdm A Pa = o. 
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A purely FOL formula expressing AxSelfo is the following: 

Vm Vp Ob(m) A Q(pi) A ... A Q(pd) -^ 

(w{m,m,p) ^ 3bB{b) AW{m,b,p) Ap2 = A . . . A pd = Oj. 

Let us also introduce a strengthened version of axiom AxSelfo: 

AxSelf An inertial observer coordinatizes itself at a coordinate point iff its space 
component is the origin: 

Vm G lOb \/p e of W{m,m,p) <-> p^ = o. 

Remark 3.1.1. At first glance it is not clear why AxSelfo is so natural. As an 
explanation, let us consider the following simple example. Let us imagine that we 
are watching sunset. What do we see? We do not see and feel that we are rotating 
with the Earth but that the Sun is moving towards the horizon; and according to our 
(the Earth's) reference system, we are absolutely right. But we learned at primary 
school that "the Earth rotates and goes around the Sun." So why does not this (i.e., 
the adoption of the heliocentric system) mean that AxSelfo a-iid our impression above 
about the sunset are simply wrong? It is because the debate between geocentric and 
heliocentric systems was not about AxSelfo, but about how to choose the best reference 
frame if we want to study the motions of planets in our solar system. See [58].^ As 
reference frames, those of the Earth, the Sun, and even the Moon are equally good. 
However, if we would like to calculate the motions of the planets, the Sun's reference 
frame is the most convenient. 

Now we formulate our next axiom on the constancy of the speed of photons. For 
convenience, we choose 1 for this speed. This choice physically means using units of 
distance compatible with units of time, such as light-year, light-second, etc. 

AxPh For every inertial observer, there is a photon through two coordinate points p 
and q iff the slope of p — g is 1: 

Vm e lOb V^, q eQ'^ \Pa-qa\ = \Pr -Qrl ^ Ph H evrr,{p) n evrr,{q) ^ 0. 



^Here we consider only the basic idea of the two systems (i.e., whether the Earth or the Sun is 
stationary) and not their details (e.g., epicycles). Of course, Ptolemy's geocentric model was wrong in 
its details since even if we fix the Earth as a reference frame, the other planets will go around not the 
Earth but the Sun. It is interesting to note that Tycho Brahe worked out a correct geocentric system 
in which the Sun and the Moon move around the Earth and the other planets move around the Sun. 
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A purely FOL formula expressing AxPh is the following: 

^mWpWq lOb(m) A Q(pi) A Q(gi) A . . . A Q(pd) A Q(gd) -^ 
((Pi - Qif = {P2 - q2f + ... + {pd- QdY 

-(-)• 3ph Ph{ph) AW{m,ph,p) AW{'m,ph,q 

Axiom AxPh is a well-known assumption of Special Relativity, see, e.g., [4], [17, §2.6]. 
We may weaken AxPh by allowing inertial observers to measure different but uniform 
speeds of light. 

AxPho For every inertial observer, the speed of light is uniform and positive, and 
there can be a photon at any point and in any direction with this speed: 

Vm G lOb 3cm e Q+ Vp , q eCf \Pa- qa\= Cm- \Pt - qA 

^ V\ir]evm{p)nevm{q)^^. 

The models of our theory SpecRel (see p. 20) would change to some extent if we replaced 
AxPh by AxPho; however, they would not be essentially different. We use AxPh for 
convenience only. Sfarti [65] proves that the principle of relativity and AxPho imply 
AxPh. 

Remark 3.1.2. For convenience, we quantify over events, too. That does not mean 
abandoning our FOL language. It is just simplifying the formalization of our axioms. 
Instead of events we could speak about observers and spacetime locations. For example, 
instead of Ve G Evm we could write \/p G Cdm 0[e~^efm(p)], where none oipi. . .pd 
occur free in 0, and 0[e -^ eVmip)] is the formula obtained from by substituting 
eVmip) for e in all free occurrences. Similarly, we can replace Ve G Ev by Vtti G 
Ob Ve G Evm 0- 

By our next axiom we assume that events observed by inertial observers are the same. 

AxEv Every inertial observer coordinatizes the very same set of events: 

Vm, k G lOb Evm = Evk- 

A purely FOL formula expressing AxEv is the following: 

\/m\/k\/p lOb(m) A IOb(A:) A Q(pi) A . . . A Q(pd) -^ 3q 

Q(gi) A . . . A Q(grf) A (V6 B{b) -^ {W{m,b,p) ^ W{k,b,q)) 

Let us now prove some consequences of the axioms introduced so far. 

19 



Proposition 3.1.3. Let h be an observer and let m and k be merim/ observers. Then 

(1) Cdm = Q'^ and evm is injective if AxPho is assumed. 

(2) eVm is a bijection from Cdm to Evm] loCm is a bijection from Ev^, to Cdm] and 
w^ is a function from Q'^ to Q'^ if eVm is injective on nonempty events. 

(3) Wm is a bijection from Q"' to Q"' if AxPho and AxEv are assumed. 

Proof . To prove Item (1), let p G Q"'. Then by AxPho, there is a photon ph such that 
ph G eVmip) n eVm{p+ (1,0,..., 0, c^, 0, . . . , 0)). Hence eVmip) 7^ for all p G Q*^. So 
Cdm = Q'^- Moreover, if g G Q'^ and q ^ p, then it is possible to choose this ph such 
that ph ^ eVmiq) also holds. Thus eVm is injective. 

Item (2) is clear since, if eVm is injective on nonempty events, both \oCm'-=ev^^ and 
Wm:=eVh o\oCm are functions. 

Let us now prove Item (3). By Item (2), we already have that evk is a bijection from 
Cdk to Evk, and loCm is a bijection from Evm to Cdm- By AxEv, Evk = Evm- Thus 
w^ = eVk o loCm is a bijection from Cdfc to Cdm- But by Item (1), we also have that 
Cdk = Cdm = Q,'^- Hence Wm is a bijection from Q"' to Q"'. jjKi: 

Let us now introduce a symmetry axiom called the symmetric distance axiom, by 
which we assume that inertial observers use the same units of measurement. 

AxSymDist Inertial observers m and k agree as to the spatial distance between events 
Ci and 62 if they are simultaneous for both of them: 

Vm, k G lOb Vci, 62 G Evm. H Evk 

ei ~m 62 A Ci ~fc 62 ^ distm(ei, 62) = distfc(ei, 62). 

Let us introduce the following axiom system: 



SpecRel:= { AxSelfo, AxPh, AxEv, AxSymDist } 



Now we have a FOL theory of Special Relativity for each natural number d > 2. 

Our symmetry axiom AxSymDist has many equivalent versions, see [2, §2.8, §3.9, 
§4.2]. Let us introduce one of them here. 

AxSymTime Any two inertial observers see each others' clocks behaving in the same 

way: 



Wk, m G lOb VA G Q 



20 



To prove that AxSymTime is equivalent to AxSymDist, let us introduce a version of 
SpecRel without this axiom: 



SpecRelo:= { AxSelfo, AxPh, AxEv } 



Theorem 3.1.4. Let d > 3 and assume AxSpecRelo- Then the following three state- 
ments are equivalent: 

(1) AxSymDist, 

(2) AxSymTime and 

(3) VA;, m G lOb w'^ is a Poincare transformation. 

On the proof . By using the fact that every Poincare transformation is the composition 
of a translation, a space-isomorphism and a Lorentz boost, it is not difficult to prove 
that Item (3) implies Items (1) and (2). 

Item (2) in Thm. 3.2.2 states that Item (1) implies Item (3). 

Finally, the implication of Item (3) by Item (2) can be proved analogously to Thm. 3.2.2, 
i.e., by proving that both the field-automorphism- induced maps and the dilations in 
the decomposition of w^ and w]^ given by Item (1) in Thm. 3.2.2 are the identity 
map. 



3.2 worldview transformations in special relativity 

To prove a theorem that characterizes the worldview transformations between inertial 
observers if only AxPh and AxEv are assumed, we need one more definition. A map (p : 
Q*^ — )■ Q'' is called a field-automorphism-induced map iff there is an automorphism 
if of the field (Q, ■, +) such that (p{p) = {(p{pi), . . . , (p{pd)) for every p G Q'^. Now we 
can state the Alexandrov-Zeeman theorem generalized for fields. 

Theorem 3.2.1 (Alexandrov-Zeeman). Let be F a field and d > 3. Every bijec- 
tion from F'^ to F'^ that transforms lines of slope 1 to lines of slope 1 is a Poincare 
transformation composed with a dilation and a field-automorphism-induced map. 

For the proof of this theorem, see, e.g., [79], [80]. From this theorem we derive the 
following characterization of worldview transformations. 

Theorem 3.2.2. Let d>3. Let m and k be meriza/ observers. Then 
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(1) if AxPh and AxEv are assumed, w^ is a Poincare transformation composed with 
a dilation D and a field-automorphism-induced map ip; 

(2) if AxPh, AxEv and AxSymDist are assumed, w'^ is a Poincare transformation. 

On the proof It is not hard to see that AxPh and AxEv imply that u;^ is a bijection 
from Q*^ to Q"' that preserves lines of slope 1, see Prop. 3.1.3. Hence Item (1) is a 
consequence of the Alexandrov-Zeeman theorem generalized for fields. 

Now let us see why Item (2) is true. By using Item (1), it is easy to see that there 
is a line / such that both / and its w^ image are orthogonal to the time-axis. Thus by 
AxSymDist, w^ restricted to / is distance-preserving. Consequently, both the dilation 
D and the field-automorphism-induced map ip in Item (1) have to be the identity map. 
Hence i/;^ is a Poincare transformation. ■ 

Thm. 3.2.2 shows that SpecRel is a good axiom system for Special Relativity if we 
restrict our interest to inertial motion. It also implies that the most frequently quoted 
predictions of Special Relativity are provable from SpecRel: 

(i) "moving clocks slow down," 
(ii) "moving meter-rods shrink" and 
(iii) "moving pairs of clocks get out of synchronism." 

Even if we only assume AxPh and AxEv, we can prove qualitative versions of the pre- 
dictions above; AxSymDist is needed if we want to prove the quantitative versions, too. 
And AxSelf is only a simplifying axiom; it makes formulating the above predictions 
easier. For more detail. See, e.g., [2, §2.5], [3, §1], [4, §2]. 

The following consequence of Thm. 3.2.2 is the starting point for building Minkowski 
geometry, which is the "geometrization" of Special Relativity. It shows how time and 
space are intertwined in Special Relativity. 

Theorem 3.2.3. Let d > 3. Assume SpecRel. Then 

timem(ei, 62)^ - distm(ei, 62)^ = timefc(ei, 62)^ - distfc(ei, 62)^ 

for any inertial observers m and k and events ei and 62 coordinatized by both of them. 

Let us finally state a corollary here about the slowing down of moving clocks. 

Corollary 3.2.4. Assume SpecRel, li > 3. Let m, /c G lOb, 61,62 G Evk, and assume 
/c G 61 n 62, distm(6i, 62) 7^ 0. Then 

timem(6i,62) > timefc(6i,62). 
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In the above corollary, a "moving clock" is represented by observer k; the fact that 
it is moving relative to observer m is expressed by distm(ei,e2) 7^ and k E ei H 62; 
and that /c's time is slowing down relative to m's is expressed by timem(ei,e2) > 
timefc(ei, 62). This "clock slowing down" is only a relative effect, i.e., "clocks moving 
relative to m slow down relative to m." But this relative effect leads to a new kind of 
gravitation-oriented "absolute slowing down of time" effect, as Chap. 8 will show. 

We can summarize the results of this chapter (that standard special relativity is 
provable from SpecRel) as a kind of completeness theorem of SpecRel with respect to 
its "intended models": 

Corollary 3.2.5. Assume d > 3. Then SpecRel is complete with respect to Minkowskian 
geometries over Euclidean ordered fields. 

The formal meaning of Cor. 3.2.5 is completely analogous to that of Thm. 9.0.6 
(about general relativity) and is explained under Thm. 9.0.6. For further details, see 
[34, §4], too. 
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Chapter 4 
Clock paradox 



As one of our main aims is to trace back the surprising predictions of relativity to 
some convincing axioms, first we investigate an axiomatic basis of the clock paradox^ 
(CP), which is an inertial approximation of the famous twin paradox. A similar logical 
investigation of the twin paradox needs a more complex mathematical apparatus, see 
[35], [73] and Chap. 7. The results of this chapter are based on [70], [72] and [73]. 

CP is one of the most famous predictions of special relativity. It concerns three 
inertial observers: one of them is the stay-at-home twin and the other two simulate 
the accelerated twin in the twin paradox. This simulation is done by replacing the 
accelerated twin by a leaving inertial observer and a returning one that synchronizes 
its clock with the leaving one's when they meet. 

In this chapter we mainly concentrate on the relation of CP to the axioms and other 
consequences of special relativity, but we also formulate and characterize variants of 
CP: one where the stay-at-home twin will be the younger one (Anti-CP) and another 
where no differential aging will take place (No-CP). 

In Section 4.1 we introduce a very basic axiom system Kinerrio of kinematics in which 
no relativistic effect is assumed. Kinerrio is a subtheory of Newtonian kinematics and 
special relativity. In Section 4.3 we formulate and prove a geometrical characterization 



^Unfortunately, it is still not uncommon for people who misinterpret the word 'paradox' to try to 
find contradietions in relativity theory, that is why we think it important to note here that its original 
meaning is "a statement that is seemingly contradictory and yet is actually true," i.e., it has nothing 
to do with logical contradiction. With the nearly century long fruitless debate in view, perhaps it 
would be better to call the paradoxes of relativity theory simply effects, thus saying "clock effect" 
instead of "clock paradox," but for the time being it appears to be a hopeless effort to have this idea 
generally accepted. Anyway, we would like to emphasize that it is absolutely pointless to try to find 
a logical contradiction in relativity theory, as its consistency has been proved, see [2, p. 77], [4, Cor. 
11.12 p.644]. 
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of CP, Anti-CP and No-CP each within the models of Kinerrio, see Cor. 4.3.5 and 
Thm. 4.3.6. In Sees. 4.4 and 4.5 we prove some surprising logical consequences of 
our characterization. In Thm. 4.4.1 we show that the absoluteness of time (in the 
Newtonian sense) is not equivalent to the lack of the clock paradox (No-CP) without 
assuming a strong theoretical axiom. Similarly, in Thm. 4.5.2 we show that the slowing 
down of moving clocks is not equivalent to CP. In Thm. 4.5.3 we show that a symmetry 
axiom of special relativity is strictly stronger than CP. 

4.1 A FOL axiom system of kinematics 

We characterize the CP under some very mild assumptions about kinematics. To 
introduce this weak axiom system (Kinerrio) we formulate some further axioms. Let us 
recall that It = (1, 0, . . . , 0); and let us define the time-unit vector of k according to 
m to be 

Im :=<(!*) -<,(o). 

AxLinTime The world-lines of inertial observers are lines and time is elapsing uni- 
formly on them: 

Vm, k e lOb wL(/c) = { w^,,{d) + A ■ 1^ : A G Q } A 

Vp, g e wL(/i;) t\mek{evm{p),evrri{q)) ■ |lf„| = \p-q\. 

Let us now introduce the aforementioned axiom system of kinematics: 



Kinemo:= { AxSelf , AxLinTime, AxEv } 



Let us note that Kinemo is a very weak axiom system of kinematics. By using Item (1) 
of Prop. 3.L3 and Thm. 3.2.2, it not difficult to show that AxSelf and AxLinTime are 
consequences of SpecRel. So Kinemo is weaker than SpecReL 

4.2 Formulating the clock paradox 

To formulate CP, first we formulate the situations in which it can occur. We say 
that inertial observer m observes inertial observers a, h and c in a clock paradox 
situation at events e, e^ and Cc iff a G e^ n e, 6 G e^ fl ec, c G e fl ec, & ^ e and 
timem(ea) < timem(e) < timem(ec) or time^(ea) > timem(e) > time^(ec), see Fig. 4.1. 
This situation is denoted by CPm(ac, h) (e^, e, Cc). 

25 




Figure 4.1: Illustration of relation CPm(ac, b){ea, e, Cc) and the proof of Prop. 4.3.1 

Let a,b,c G lOb and e^, e, e^ G -E"!". Let time(ac < 6) (e^, e, e?,) be an abbreviation 
for timea(ea,e) + timec(e, ec) < timeb(ea, ec). The definitions of time(ac = 6) {ea,e,eb) 
and time(ac > 6) (ca, e, Cb) are analogous. Using this notation, we can formulate the 
clock paradox as follows: 

CP Every inertial observer m observes the clock paradox in every clock paradox sit- 
uation: 

Vm, c, a, 6 G lOb Ve, ea,ec G -Ewm CPm(ac, &)(ea, e, Cc) — ?■ time(ac < 6)(ea, e, Cc). 

We define formulas NoCP and AntiCP by replacing '<' by '=' and '>' in the formula 
CP, respectively. 



4.3 Geometrical characterization of CP 

We say that g G Q'^ is (strictly) between p E Qf" and r G Q"^ iff there is a A G Q such 
that q = \'p+ (1 — A)r and < A < 1. This situation is denoted by Bw (p, g, r ). 

Let p,q,f^ Q*^ and /i G Q such that Bw{p, fiq,f). In this case we use notations 
Conv (p, q, r) and Cone (p, g, r ) if 1 < /i and < /i < 1, respectively. 

For convenience, we introduce the following notation: 



+p: = 



P if pt> 0, 
—p if pt < 0. 
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convex 



concave 



Figure 4.2: Illustration of relations Cony(^p,qi,r), B\N{^p,q2,r) and Conc{^p,q3,: 



Proposition 4.3.1. Assume Kinerrio. Let m, a, b, and c be meriza/ observers and e, 
Ca and Cc events such that CPm(ac, 6)(ea, e, Cc). Then 



time(ac < b){ea, e, Cc) 
time(ac = 6)(ea, e, Cc) 
time(ac > &)(e(i, e, Cc) 






Proof . Let m, a, b, and c be inertial observers and e, Cq and Cc events such that 
CPm(ac, &)(ea, e, Cc). Let us abbreviate time-unit vectors ^V^ as k} throughout this 
proof. Let p = loCm(ea), q = loCm(e) and r = loCm(ec). We have that p ^ r since 
p^ < r,- or r,- < p^. Therefore, by AxLinTime, the triangle pqr is nondegenerate since 
p,r E wl„^(6) but q ^ wlm(6). Let us first show that b measures the same length of time 
between Cq and Cc as a and c together if Bw(a"'", fe"'-, c"'") holds. Let s'be the intersection of 
line{p, r) and the line parallel to line{a^, c^) through q, see Fig. 4.1. Then the triangles 
oa^b^ and pqs are similar; and the triangles ob^c^ and fsq are similar. Thus 



\p — q\ \p — s 



and 



\q — r 



\at\ \bi\ \ci\ 

hold. From which, by AxLinTime, it follows that 

\p — q\ \q — f\ 



\bt\ 



timea(ea,e 



timec(e,ecj 



it I 



\ct\ 
\p — s\ + Is* 



\r — p\ 



timec(ea,ec) 



27 



Hence time(ac = b){ea,e,ec) holds if B\N{a^,b^,c^). By AxLinTime, b measures more 
(less) time between Cq and Cc iff its time- unit vector is shorter (longer). Thus we get 
that time(ac < 6)(ea,e, Cc) holds if Conv {a^,b^,c^), and time(ac > fe)(ea,e, Cc) holds if 
Conc(a"'", b^, c^). The converse implications also hold since one of the relations Conv, Bw 
and Cone holds for a^, b^ and c^, and only one of the relations time(ac < b), time(ac = b) 
and time(ac > b) can hold for events Cq, e and Cc- This completes the proof. ■ 

A set if C Q"' is called convex iff Conv(p, q, r ) for all p,q,fE H for which there is 
a yU G Q^ such that B\n{p, fiq,f) holds. We call H fiat or concave if Conv(p, g, r) is 
replaced by Bw(g, r, p) or Conc(r, p, g), respectively. 

Remark 4.3.2. If there are no p,q,f E H for which there is a /i G Q^ such that 
Bvj{p, fiq,f) holds, then H is convex, fiat and concave at the same time. To avoid 
these undesired situations, let us call H nontrivial if there are p,q,fE H such that 
B\n{p, fiq,f) holds for a /i G Q"*". By the respective definitions, it is easy to see that 
any nontrivial convex (fiat, concave) set intersects a halfiine at most once. 

Let us define the Minkowski sphere as MSl^ := { ■'■ij^ : A; G lOb } . 

Remark 4.3.3. Convexity as used here is not far from convexity as understood in 
geometry or in the case of functions. For example, in the models of Kinemo + AxThExp+ 
or SpecRelo + AxThExp the Minkowski Sphere MS^ is convex in our sense iff the set 
of points above it (i.e., {p G Q'' : 3g G MS^ Pr > qr}) is convex in the geometrical 
sense. Axioms AxThExp+ and AxThExp are introduced on pp. 29 and 32, respectively. 

Remark 4.3.4. By Rem. 4.3.2, if MSj^ is a nontrivial convex (fiat, concave) set, it 
intersects a line at most once. 

Now we can state the following corollary of Prop. 4.3.1. 

Corollary 4.3.5. Assume Kinerrio. Then 



) 



Vm G 10b MSl^ is convex =^ CP, 
Vm G 10b MSi is fiat =^ NoCP, 

Vm G 10b MSl, is concave =^ AntiCP. 

The implications in Cor. 4.3.5 cannot be reversed because there may be inertial 
observers that are not part of any clock paradox situation. We can solve this problem 
by using the following axiom to shift inertial observers in order to create clock paradox 
situations. 
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AxShift Any inertial observer observing another inertial observer with a certain time- 
unit vector also observes still another inertial observer, with the same time-unit 
vector, at each coordinate point of its coordinate domain: 

Vm, k e lOb Vp G Cdm ^h G lOb he evm{p) A 1^ = 1^. 

Now we can reverse the above implications. 

Theorem 4.3.6. Assume Kinerrio and AxShift. Then 

CP <^=^ Vm G lOb MSl^ is convex, 

NoCP ^^ yme lOb MSl is fiat, 
AntiCP ^^ Vm G lOb MSJ^ is concave. 

Proof . By Cor. 4.3.5, we have to prove the "^^" part only. For that, let us take three 
points a', b' and c' from MS}^ for which there is a /i G Q satisfying Bw(-'-a',/i6', "'"c'). 
If there are no such points, MS^ is convex, flat and concave at the same time, see 
Rem. 4.3.2. Otherwise, by AxShift, there are inertial observers a, b and c in a clock 
paradox situation such that 1^ = a', 1^ = b' and 1^ = c'. Thus from Prop. 4.3.1 we 
get that MSJ^ has the desired property. M_ 

4.4 Consequences for Newtonian kinematics 

Let us investigate the connection between No-CP and the Newtonian assumption of 
the absoluteness of time. 

AbsTime All inertial ohserveis measure the same elapsed time between any two events: 
Vm, k G lOb Vei, 62 G Ev timem(ei, 62) = timefc(ei, 62). 

To strengthen our axiom system, we introduce two axioms that ensure the existence 
of several inertial observers. 

AxThExp^ Inertial observers can move in any direction at any finite speed: 

Vm G lOb ^p,qe Cf Pr ^ Qr -^ 3A; G lOb k G eVm{p) H eVm{q)- 

Let us also introduce a less theoretical version of this axiom. 
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AxThExp* Inertial observers can move in any direction at a speed which is arbitrarily 
close to any finite speed: 

Vm e lOb Vp, g G Q'^ Ve G Q+ Pr ^ Qr 

-> 3A; G lOb 3g' G Q'^ \q - q\<e^ke eVm{p) n evm{q). 

By the following theorem, NoCP logically implies AbsTime if AxThExp"*" (and some 
auxiliary axioms) are assumed; however, if we assume the more experimental axiom 
AxThExp* instead of AxThExp"*", AbsTime does not follow from NoCP, which is an 
astonishing fact since it means that without the theoretical assumption AxThExp^ we 
would not be able to conclude that time is absolute in the Newtonian sense even if 
there were no clock paradox in our world. 

Theorem 4.4.1. 

AbsTime [= NoCP, and (4.1) 

Kinemo + AxShift + AxThExp+ + NoCP ^ AbsTime, but (4.2) 

Kinemo + AxShift + AxThExp* + NoCP ^ AbsTime. (4.3) 

Proof . Item (4.1) is obvious. 

To prove (4.2), let us note that MS}^ is fiat by Thm. 4.3.6 since Kinemo, AxShift 
and NoCP are assumed. By axiom AxThExp^, MS^^ intersects any nonhorizontal line. 
So MS^ has to be a horizontal hyperplane containing (1,0,..., 0). Hence the time 
components of time-unit vectors are the same for every inertial observer. So AbsTime 
follows from the assumptions. 

To prove (4.3), we construct a model in which Kinemo, AxShift, AxThExp* and 
NoCP hold, but AbsTime does not. Let (Q; +, -, <) be any Euchdean ordered field. Let 
S:=Q'^ X Q"'. Let IOb:={(p, g ) G B : Pr ^ Qr A Pr - Qr ^ P2 - 92}. Let 

MS'J^ 0^:= {x eQ'^ : Xr-X2 = l A x^ > } . 

Let Vr((l,0), {p,(l)ir) hold iff r is in line{p,q). Now the worldview relation is given 
for inertial ohserver (1,0). For any other inertial ohseiver {p,q), let W/fo) ^^ ^^ affine 
transformation that takes o to p while its linear part takes It to MS,-^ Q^r\{\{p—q) : A G 
Q}, and leaves the other basis vectors fixed. From these worldview transformations, it 
is easy to define the worldview relations of other inertial observers, hence our model 
is given. It is not difficult to see that Kinemo, AxShift and AxThExp* are true in this 
model. Since MS,-^ qv is fiat and the worldview transformations are affine ones, it is 
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clear that MS^ is flat for all m G lOb. Hence NoCP is also true in this model by 
Cor. 4.3.5. It is easy to see that AbsTime implies that (lm)T = ±1 for all m,k E lOb. 
Hence AbsTime is not true in this model, as we claimed. 11 



4.5 Consequences for special relativity theory 

Now we investigate the consequences of our characterization for special relativity. To 
do so, let us flrst note that if c? > 3, our theory SpecRelo is strong enough to prove the 
most important predictions of special relativity, such as that moving clocks get out of 
synchronism, see Section 3.2. At the same time, SpecRelo is weak enough not to prove 
every prediction of special relativity. For example, it does not entail CP or the slowing 
down of relatively moving clocks. Thus it is possible to compare these predictions 
within models of SpecRelo. To investigate the logical connection between them, let us 
formulate the slowing down effect on moving clocks within our FOL framework. 

SlowTime Relatively moving meriza/ observers' clocks slow down: 

Vm,A;GlOb wU(A;) 7^ wL(m) -^ |(i;;),|>l. 

To prove a theorem about the logical connection between SlowTime and CP, we need 
the following lemma, which states that the fact that three inertial observers are in a 
clock paradox situation does not depend on the inertial observer that watches them. 

Lemma 4.5.1. Let d > 3. Assume AxPh, AxEv and AxLinTime. Let m,a,b,c G lOb 
and let Cq, e, e^ G Ev. Then 

CPraiac,b){ea,e,ec) ^ CPb{ac,b){ea,e,ec). 

Proof . By (1) of Thm. 3.2.2, AxPh and AxEv imply that w^ is a composition of 
a Poincare transformation, a dilation and a fleld-automorphism-induced map. By 
AxLinTime, the fleld-automorphism is trivial. Hence timem(e) is between timem(ea) 
and timem(ec) iff timeb(e) is between timefc(ea) and timeb(ec). This completes the proof 
since the other parts of our deflnition of CP do not depend on inertial observers m and 
b. '■. 

We cannot consistently extend SpecRelo by axiom AxThExp^ since SpecRelo implies 
the impossibility of faster than light motion of inertial observers ii d > 3, see, e.g., [3]. 
That is why we have to weaken this axiom. 
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AxThExp Inertial ohservers can move in any direction at any speed slower than 1, i.e., 
the speed of hght: 

Vm e lOb Vp, g e Q'^ \Pa - Qa] < \Pt - Qr] -^ 3kelOh k E eVmip)r]eVm{q). 

The following theorem shows that the slowing down of moving clocks (SlowTime) is 
logically stronger than CP. 

Theorem 4.5.2. Let ci > 3. Then 

SpecRelo + AxLinTime + SlowTime |= CP, but (4.4) 

SpecRelo + AxLinTime + AxShift + AxThExp + CP ^ SlowTime. (4.5) 

Proof . Item (4.4) is clear by Lem. 4.5.1. 

To prove Item (4.5), let us construct a model in which SpecRelo, AxLinTime, AxShift, 
AxThExp and CP hold, but SlowTime does not. Let (Q; +,-,<) be any Euclidean 
ordered field. Let i?:=Q"' x Q^. Let IOb:={(p, g) E B : \pa- — q^l < IPr — qrl}- It is easy 
to see that there is a nontrivial convex subset M of Q'^ such that It E M and |p^| < 1 
for some p E M. Let MSi-^^s be such a convex subset of Q*^. Let Vr((l,0), {p,q),r) 
hold iff r* is in line{p,q). Now the worldview relation is given for inertial observer 
(1,0). By Rem. 4.3.4, MS.^^^ intersects a line at most once. For any other inertial 
observer {p, q) , let wF^V be such a composition of a Lorentz transformation, a dilation 
and a translation which takes o to p while its linear part takes 1^ to the unique element 
of MS,^ Q-. n {\{p — q) : X E Q}, and leaves the other basis vectors fixed. It is easy to 
see that there is such a transformation. From these worldview transformations, it is 
easy to define the worldview relations of the other inertial observers. So the model is 
given. It is not difficult to see that SpecRelo, AxShift, AxLinTime and AxThExp are true 
in this model. Since MS,-^ ^v is convex and the worldview transformations are afiine 
ones, it is clear that MS^^ is convex for all m E lOb. Hence CP is also true in this 
model by Cor. 4.3.5. It is clear that SlowTime is not true in this model since there is 
ap E MSj^^^s such that |p^| < 1 (i.e., there is k E lOb such that 1(1^ o>)-^l ^ -'-)' ^^^ 
that completes the proof. S; 

Like the similar results of [72] and [73], Thm. 4.5.3 answers Question 4.2.17 of 
Andreka-Madarasz-Nemeti [2]. It shows that CP is logically weaker than the symmetric 
distance axiom of SpecRel. 

Theorem 4.5.3. Let d>3. Then 

SpecRelo + AxSymDist \= CP, but (4.6) 

SpecRelo + AxLinTime + AxShift + AxThExp + CP ^ AxSymDist. (4.7) 
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Proof . By (2) of Thm. 3.2.2, SpecRelo and AxSymDist imply that w^ is a Poincare 
transformation for all m,k ^ lOb. Hence 

MSi c {pe qf : pI- I/vI' = 1 a p. > } . 

Consequently, MS^ is convex. So by Cor. 4.3.5, CP follows from SpecRelo and AxSymDist. 

Since SpecRelo and AxSymDist imply SlowTime ii d > 3, Item (4.7) follows from 
Thm. 4.5.2. S 

It is interesting that, if the quantity part is the field of real numbers, AxSymDist and 
SlowTime are equivalent in the models of SpecRelo and some auxiliary axioms. However, 
that the quantity part is the field of real numbers cannot be formulated in any FOL 
language of spacetime theories. Consequently, nor can Thm. 4.5.4, so it cannot be 
formulated and proved within our FOL frame either. 

Theorem 4.5.4. Assume SpecRelo, AxLinTime, AxShift, AxThExp, and that Q is the 
field of real numbers. Then 

SlowTime <^=^ AxSymDist. 

For proof of Thm. 4.5.4, see [73, §3]. This theorem is interesting because it shows 
that assuming only that all moving clocks slow down to some degree implies the exact 
ratio of the slowing down of moving clocks (since SpecRelo + AxSymDist implies that 
the worldview transformations are Poincare ones, see Thm. 3.2.2). 

Question 4.5.5. Does Thm. 4.5.4 retain its validity if the assumption that Q is the 
field of real numbers is removed? If not, is it still possible to replace it by a FOL 
assumption, e.g., by axiom schema CONT used in [35], [36], [73] and Chaps. 7, 8 and 
10? 

We have seen that (the inertial approximation of) CP can be characterized geomet- 
rically within a weak axiom system of kinematics. We have seen some consequences of 
this characterization; in particular, CP is logically weaker than the assumption of the 
slowing down of moving clocks or the AxSymDist axiom of special relativity. A future 
task is to explore the logical connections between other assumptions and predictions 
of relativity theories. For example, in [35], [73] and Chap. 7, SpecRelp+AxSymDist is 
extended to an axiom system Ace Re I logically implying the twin paradox (the accel- 
erated version of CP), but the natural question below, raised by Thm. 4.5.3, has not 
been answered yet. 

Question 4.5.6. Is it possible to weaken AxSymDist to CP in AccRel without losing 
the twin paradox as a consequence? See [35, Que. 3. 8]. 
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Chapter 5 

Extending the axioms of special 
relativity for dynamics 



Another surprising prediction of relativity theory is the equivalence of mass and energy. 
To find an axiomatic basis to this prediction, we have to extend our approach to 
dynamics. The results of this chapter are based on [6] and [7] . 

The idea is that we use collisions for measuring relativistic mass. We could say that 
the relativistic mass of a body is a quantity that shows the magnitude of its influence 
on the state of motion of the other bodies it collides with. The bigger the relativistic 
mass of a body is, the more it changes the motion of the bodies colliding with it. To 
be able to formulate this idea, let us extend our FOL language by a new [d + 3)-ary 
relation M for relativistic mass. We use this relation to speak about the relativistic 
masses of bodies according to observers by translating M{b,p, x, k) as "the relativistic 
mass of body b at coordinate point p is x according to observer k." Since there can 
be more than one x which is M-related to b, p and k, we introduce the following 
definition: the relativistic mass of body 6 at p G Q'' according to observer k, 
in symbols mk{b,p) , is defined as x if M{b,p,x,k) holds and there is only one such 
X G Q; otherwise mk{b,p) is undefined. 

5.1 Axioms of dynamics 

In this section we introduce a FOL axiomatic theory of special relativistic dynamics. 
In our first axiom on relativistic mass, we assume that it is positive in meaningful and 
zero in meaningless situations. 
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AxMass According to any observer, the relativistic mass of a body b at any coordinate 
point p is defined and nonnegative, and it is zero iff b is not present at p: 

yk e Oh \/b e B \/p e Q'^ mkib,p) >0 A {mkib,p) = ^ b^evk{p)). 

In our co-authored papers [6] and [7], this axiom was built into the logic frame. 

To formulate our other axioms on relativistic mass, first we have to define collisions. 
To do so, we introduce the following concepts: the set of incoming bodies ink{q) and 
that of outgoing bodies outk{q) of a collision at coordinate point g* according to observer 
k are defined as bodies whose world-lines "end" and "start" at q, respectively (see 
Fig. 5.1): 

mfc(g) := {beB: q E \N\k{b) A VpGwlfc(6) Pr < qr \/ p = q } , 
outk{q) := {beB : q E \N\k{b) A Vpewlfc(6) Pr > qr V p = q}. 



Bodies bi, . . . ,bn collide originating bodies di, . . . ,dm according to observer k, in 
symbols collfc(6i, . . . , 6„ : di, . . . , dm) , iff h 7^ bj and di 7^ dj whenever i j^ j and there 
is a coordinate point q such that ink{q) = {61, . . . , bn} and outk{q) = {di, . . . , dm}. 
Inelastic collisions are just collisions in which only one body is originated. So in this 
case, we write inecollfc(6i, . . . , 6„ : (i) in place of collfc(&i, . . . , 6„ : (i) and say that bodies 
fei, . . . ,6„ collide inelastically originating body d according to observer k. For the 
illustration of these concepts, see Fig. 5.1. 

outk{q) 



Qr <► 





b c 

inecollfc(6, c : d) 



Figure 5.1: Illustration oi irik^q), outk{q) and inecollfc(fe, c : d) 

The spacetime location \oCf.{t) of body b at time instance t E Q according to 
observer k is defined as the coordinate point p for which p E wlfc(6) and Pr = t hold if 
there is such a unique p; otherwise loC;,(t) is undefined, see Fig. 5.2. 
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The center of masses cen^^''"' "(t) of bodies 61, . . . , 6„ according to A; G Ob at time 
instance t is defined by: 

n 

Y,Mb^,\o^^{t)) ■ (cen^-'^"(t) - loc^'W) = 



i=l 



bi,...,b„ 



IS 



if loc^'(t) and mfc(6j, loc^'(t)) are defined for all 1 < i < n; otherwise cen^^''"' "(t) i 
undefined. Let us note that the following is an explicit definition for cen^^'"' "(t): 

cen^-'^-m = V mikJoc'^jt)) 1^^, 

if loc^'(t) and mfc(6j, loc^'(t)) are defined for all 1 < z < n. The center-line of the 
masses of bodies bi, . . . ,bn according to observer k is defined as: 

cenfc(6i, . . . , 6„) := I cen^i'-'^"(t) : t e Q and cenl^'-'^"{t) is defined | , 

i.e., the center-line of mass is the world-line of the center of mass. 

Remark 5.1.1. Let us note that cen^,(t) = loc^,(t) for all k G Ob, 6 G B and t G Q, 
and thus cer\k{b) = v\i\k{b) for every k G Ob and 6 G B if mk{b, \oCi^{t)) is defined and 
nonzero for all t G Dom loc^, (e.g., if AxMass is assumed). 

The segment determined hj p,qE Q'^ is defined as: 

[p, q]:= { Xp+ (1 - A)g : A G Q, < A < 1 } . 
Let us call if C Q'^ a line segment if 

• it is connected, (i.e., [p, q]'^H for all p,qE H), 

• it is a subset of a line, and 

• it has at least two elements. 

Bodies whose world-lines are line segments according to every inertial observer are 
called inertial bodies, and their set is defined as: 

IB:={6 G B : VA; G lOb v\i\k{b) is a line segment}. 

Proposition 5.1.2. Let k be an inertial observer and 61, . . . , 6„ inertial bodies such 
that, for all 1 < i < n, p, g G wlfc(6j) imply mk{bi,p) = mfc(6j,g) > 0. Then the 
following hold: 
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cen'^^t) 



yk 



OCk{c,t) 



cenfc(6, c 







lyd 




X ' 1 




yb/ 


cenfc'(6, c) 


^ Vc 



6,c 



Figure 5.2: Illustration of cen^''^(t), cenfc(6, c) and of axiom AxCenter 



(1) cenfc(fei, . . . , 6„) is a line segment, a point or empty, 

(2) cenfc(6i, . . . , 6„) is nonhorizontal, i.e., r = s if r, s G cenfc(6i, . . . , 6„) and r,- = s,-, 

(3) wlfc(6i) n . . . n wlfc(6„) C cenfc(6i, . . . , fen), 

(4) cenfc(fei, . . . , 6„) is a line segment if collfc(6i, . . . ,bn : di, . . . ,dm) or co\\k{di, ...,dm- 
bi, . . . ,bn) for some (not necessarily inertial) bodies di, . . . ,dm- 

Here we omit the easy proof. 

Now we are ready to formalize that the relativistic mass of a body is a quantity 
that shows the magnitude of its influence on the state of motion of any other body it 
collides with. 

AxCenter The world-line of the inertial body originated by an inelastic collision of two 
inertial bodies is the continuation of the center-line of the masses of the colliding 
meriza/ bodies according to every meriza/ observer (see Fig. 5.2): 

VA; G lOb V6, c, rf G IB i need 1^(6, c : d) — )■ cenfc(6, c)Uwlfc((i) C i for some line i. 



The main axiom of SpecRelDyn is AxCenter which, in a certain sense, can be taken 
as a definition of relativistic mass. The other axioms of our axiom system will be 
simplifying or auxiliary ones to make life simpler. We could only get rid of them at the 
expense of sacrificing the simplicity of expressions. 

AxCenter is an axiom in Newtonian Dynamics, too, where the mass mfc(&,p) of a 
body b does not depend on observer k and coordinate point p. However, in special rel- 
ativity, AxCenter implies that the mass of a body necessarily depends on the observer. 
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The reason for this fact is that the simultaneities of different observers in special rel- 
ativity may differ from one another, and this implies that the proportions involved in 
AxCenter change, too. See [7, Prop.4.1]. 

The velocity v^(t) and speed f^(t) of body b at instant t G Q according to observer 
k are defined as: 

v^W:=((lo4).)'W and vl{t) : = \^rl{t)\ 

if loC;j(t) is defined and loc;, is differentiable at t; otherwise they are undefined. (For 
the FOL definition of f'{t), see Section 10.3.) Let us note that 

((loct).)'(t) = {{\oclnt))^ and (lo4(t).)' = ((lo4)'),(t) = 1 

if loc^(t) is defined and differentiable. 

The rest mass mo (6) of body b is defined as A G Q if (1) there is an observer 
according to which b is at rest and the relativistic mass of b is A, and (2) the relativistic 
mass of 6 is A for every observer according to which b is at rest. That is, mo(&) = A if 

3kE0h \/tEDomvl vl{t) = A Vp G wlfe(6) mfc(6,p) = A 
A VA; G Ob Vt G Domvl vl{t) = ^ Vp G wlfc(6) mfc(6,p) = A 

if there is such A; otherwise mo(6) is undefined. 

We have seen that AxCenter implies that the relativistic mass depends on both b 
and k. Our next axiom states that the relativistic mass of a body depends on its rest 
mass and velocity at the most. 

AxSpeed According to any inertial observer, the relativistic masses of two inertial 
bodies are the same if both of their rest masses and speeds are equal: 

VA; G 10b V6, c G B Vp, g G Q'^ be evk{p) A c G evk{q) 

A mo(6) = mo(c) A i;^(p^) = vl{qr) -^ mk{b,p) = mfc(c,g). 

Let Bo be the set of bodies having rest mass, i.e., Bo:= { 6 G B : mo(6) is defined }, and 
let IBo be the set of inertial bodies having rest mass, i.e., IBo :=IB fl Bq. 

By the following proposition, AxSpeed implies that the relativistic mass of an inertial 
body having rest mass does not change in time according to inertial observers. 

Proposition 5.1.3. 

AxSpeed h VA; G 10b V6 G IBo Vp,g G w\k{b) mk{b,p) = mk{b,q). 

38 



Proof. By the respective definitions, it is easy to see that vl{pr) = v\.{qr) for all 
p,qE wlfc(6). Hence by AxSpeed, mk{b,p) = 1x1^(6, g) if p, g G wlfc(6), k is an inertial 
observer, and h is an inertial body having rest mass. WL 

Prop. 5.1.3 leads us to introduce the following definition: 1x1^(6) is defined as 
^kip.p) if mfc(6, p) = mfc(6, q) for all p, g G wlfc(6); otherwise 1x1^(6) is undefined. So by 
Prop. 5.1.3 mfc(6) is defined if 6 G IBq, k G 10b and AxSpeed is assumed. Similarly, we 
use notations Vfc(6) and Vk{h) instead of v^(t) and vl.{t) when h and /c are inertial., as 
in this case v^.(ti) = v^(t2) for all ti,t2 G Domw\. 

Our last axiom on dynamics states that every observer can make experiments in 
which they make inertial bodies of arbitrary rest masses and velocities collide inelasti- 
cally: 

AxVinecoll For any inertial observer, any possible kind of inelastic collision of inertial 
bodies can be realized: 

V/c G Ob Vvi, V2 G Q'^"^ Vmi,m2 G Q |vi| < 1 A |v2| < 1 

A vi 7^ V2 A mi > A 7772 > -)• 36, c, rf G IB inecollfc(6, c : d) 

A Vfc(6) = vi A Vfc(c) = V2 A mo(6) = 7711 A mo(c) = 7772- 

We often add axioms to SpecRel which do not change the spacetime structure, but 
are useful as auxiliary axioms. For example, AxThExp''^ below states that every observer 
can make thought experiments in which they assume the existence of "slowly moving" 
observers (see, e.g., [4, p. 622 and Thm.11.10]): 

AxThExp^ For any inertial observer, in any spacetime location, in any direction, at 
any speed slower than that of light it is possible to "send out" an inertial ohserver 
whose time flows "forwards:" 

VA; G 10b Vp, g-G Q'' \{p- q)^\ < {p - q)r 

^ 3/7 G lOb /7 G evk{p) n evk{q) A wl{q)r < w'^{p)r. 

Let us extend SpecRel by AxThExp^ and the axioms of dynamics above: 



SpecRelDyn:= | AxMass, AxCenter, AxSpeed, AxVinecoll, AxThExp^ | U SpecRel 



Let us note that SpecRel Dyn is provably consistent. Moreover, it has nontrivial models, 
see Prop. 5.3.7. 
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The following theorem provides the connection between the rest mass and the rel- 
ativistic mass of an inertial body. Its conclusion is a well-known result of special 
relativity. We will see that our theorem is stronger than the corresponding result in 
the literature since it contains fewer assumptions. 

Theorem 5.1.4. Let d > 3. Assume SpecRelDyn and let k be an inertial observer 
and b be an inertial body having rest mass. Then 

mo(6) = a/1 - Vkiby ■ mkib). 

A purely geometrical proof of Thm. 5.1.4 can be found in [7]. 

Remark 5.1.5. Assuming AxPh, photons cannot have rest masses since their speed 
is 1 according to any inertial observer. However, by Thm. 5.1.4, it is natural to extend 
our rest mass concept for photons as mo(p/i):=0 for all ph G Ph. After this extension 
photons may be regarded as "pure energy" as they have zero rest masses. 

Remark 5.1.6. The conclusion of Thm. 5.1.4 fails if we omit any of the axioms 
AxMass, AxCenter, AxSpeed, AxVinecoll, AxThExp^ from SpecRelDyn. However, it re- 
mains true if we weaken AxVinecoll and AxThExp''' to the following two axioms, respec- 
tively: 

Ax3inecoll According to every observer, for every inertial body a having rest mass, 
there are inertial bodies b and c colliding inelastically such that a, b and c have 
the same rest masses, a and b have the same speeds and the speed of c is (see 
the left-hand side of Fig. 5.3): 

\/k G lOb Va e IBo 36, c,d elB mo(a) = mo(6) = mo(c) 

A Vkib) = Vkia) A Vkic) = A inecollfc(6, c : d); 

AxMedian For every two inertial bodies colliding inelastically, there is an inertial ob- 
server for which these two inertial bodies have opposite velocities and collide 
inelastically (see the right-hand side of Fig. 5.3): 

\/kelOh\/b,c,delB \neco\\k{b,c: d) 

— > 3/i G 10b v/i(6) = — v^j(c) A inecoll/j(fe, c : d). 

On Einstein's E = mc^: The conclusion mo(6) = ^/l — Vk{bY ■ ^k{b) of our Thm. 5.1.4 
above is used in Rindler's relativity textbook [52, pp. 111-114] to explain the discovery 
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mo(a) = mo{b) = mo(c) 



VA;i 



3/il 





Figure 5.3: Illustration of axioms Ax3inecoll and AxMedian 



and meaning of Einstein's famous insight E = mc?. We could literally repeat this part 
of the text of [52] to arrive ai E = mc^ in the framework of our theory SpecRelDyn 
based on the axiom AxCenter. We postpone this to a later point, because then we will 
have developed more "ammunition," hence the didactics can be more inspiring. 



5.2 Conservation of relativistic mass and linear mo- 
mentum 

In a certain sense AxCenter states that the center of mass of an isolated system consist- 
ing of two inertial bodies moves along a line regardless whether the two bodies collide 
or not. It is natural to generalize AxCenter to more than two bodies (but permitting 
only two- by-two inelastic collisions). Let AxCentern denote, temporarily, a version of 
AxCenter concerning any isolated system consisting of n bodies. Thus AxCenter is just 
AxCenter2 in this series of increasingly stronger axioms. We will see that it does not 
imply AxCenters; thus AxCenters is strictly stronger than AxCenter if certain auxiliary 
axioms are assumed, see Cor. 5.3.3 and Prop. 5.3.4. However, it can be proved that the 
rest of the axioms in this series are all equivalent to AxCentera if AxCenter is assumed, 
see Cor. 5.4.4. That motivates us to introduce SpecRelDyn'^ by replacing AxCenter in 
SpecRelDyn by the stronger AxCentera. Our theory SpecRelDyn^ is still very geometric 
and observation-oriented in spirit. Let us now introduce AxCenters, and denote it as 
AxCenter+. 
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AxCenter"'' If a is an inertial body and inertial bodies h and c collide inelastically origi- 
nating inertial body d, the center-line of the masses of a and d is the continuation 
of the center-line of the masses of a, h and c, see Fig. 5.4: 

VA; e Ob Va, h,c,de IB 

inecollA;(6, c : (i) — )■ cenfc(a, 6, c) U cenfc(a, d) C £ for some line £. 



VA; 




Figure 5.4: Illustration of AxCenter'' 



Let us replace AxCenter by AxCenter^ in SpecRelDyn: 



SpecRelDyn''":= { AxMass, AxCenter'^, AxSpeed, AxVinecoll, AxThExp^ } U SpecRel 



Let us note that SpecRelDyn"'' is also consistent. Moreover, it has nontrivial models, 
see Prop. 5.3.7. 

AxCenter determines the velocity of the body emerging from an inelastic collision, 
and we will see that AxCenter"*" also determines the relativistic mass of the body emerg- 
ing from the collision. 

Let us now formulate the conservation of relativistic mass in our FOL language. 

ConsMass If inertial bodies h and c collide inelastically originating inertial body d, 
the relativistic mass of d is the sum of the relativistic masses of h and c: 

V/c G lOb V6, c, (i G IB inecollfc(6, c : (i) — )■ mk{h) + mk{c) = n\k{d) . 



The linear momentum of inertialhodj b according to meriza/ observer k is defined 
as mfc(6)vfc(&) if Vfc(6) and ir\k{b) are defined, otherwise it is undefined. Now we can 
formulate the conservation of linear momentum in our FOL language. 
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ConsMomentum If inertial bodies h and c collide inelastically originating inertial body 
d, the linear momentum of d is the sum of the linear momentum of h and c: 

\/k e lOb V6, c,d e IB inecollfc(6, c : d) -)■ mfc(6)vfc(6)+mfc(c)vfc(c) = mfc((i)vfe((i). 

To state a theorem on the connection of AxCenter, ConsMass and ConsMomentum, we 
need the following auxiliary axiom. 

AxInMass According to any inertial observer, the relativistic mass of every inertial 
body is constant: 

VA; G lOb V6 G IB Vp,g G wU(6) mk{b,p) = mk{b,q). 

That is a consequence of AxSpeed for inertial bodies having rest mass, see Prop. 5.1.3. 
In [6] and [7], this axiom was also built in our logic frame. 

The following theorem states that axiom AxCenter+ is equivalent to the conjunc- 
tion of ConsMass and either of the two formulas AxCenter and ConsMomentum if certain 
auxiliary axioms are assumed. That means in a sense that ConsMass represents the "dif- 
ference" between AxCenter and AxCenter"*", and the same holds if AxCenter is replaced 
by ConsMomentum. 

Theorem 5.2.1. Let us assume AxMass, AxInMass, AxSelf and that lOb C IB. Then: 

AxCenter^ <^=^ ConsMass A ConsMomentum <^=^ ConsMass A AxCenter. 
The proof of Thm. 5.2.1 is in [6]. 

Corollary 5.2.2. Let us assume SpecRelDyn^. Let k be an inertial ohserver and b, c 
and d inertial bodies such that inecollfc(&, c : d) holds. Then 

mfc(d) = mfc(6) + mfc(c), but 

mo(rf) > mo(6) + mo(c), whenever Vfc( 6) 7^ Vfc(c). 

The proof itself is in [6] , here we are only concerned with the idea of the proof. 

Returning to E = mc^: Cor. 5.2.2 above can be used to arrive at Einstein's insight 
E = mc^ in the same way as it is done in Rindler's [52] and d'Inverno's [17] relativity 
textbooks. Namely, we have seen above that rest mass can be created under appropriate 
conditions. Created from what? Well, from kinetic energy (energy of motion). That 
points in the direction of Einstein's connecting mass with energy. In more detail, let 
us start with two bodies 61 and 62 of rest mass mo. Let us accelerate the two bodies 
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towards each other and let them colhde inelastically, so that they stick together forming 
the new body "61 + 62" (dehberately sloppy notation). Let us assume 61 + 62 is at rest 
relative to the observer conducting the experiment. Then the rest mass mo(6i + 62) is 
the sum of relativistic masses mfc(6i) and 1x1^(62)- Assuming that at collision the speed 



of both 61 and 62 were v, we have mo(6i + 62) = mo(6i)/-\/l — v"^ + ^o{p2) / yl — v^ , 
which is definitely greater than mo(&i) + 1x10(62) if f 7^ 0. So, rest mass was created from 
the kinetic energy supplied to our test bodies hi and 62 when they were accelerated 
towards each other. So far, we have a qualitative argument (based on our SpecRelDyn^) 
in the direction that energy (in our example kinetic) can be "transformed" to "create" 
mass. A quantitatively (and physically) more detailed analysis oi E = mc^ in terms of 
Thm. 5.1.4 is given in [52, pp. 111-114] where we refer the reader for more detail and 
for the "second part" of the argument. The "first part" was provided by Thm. 5.1.4 
and Cor. 5.2.2. 

Proposition 5.2.3. 

SpecRelDyn ^ ConsMass, and 
SpecRelDyn ^ ConsMomentum. 

The proof of Prop. 5.2.3 is in [6]. 

In the literature, the conservation of relativistic mass and that of linear momentum 
are used to derive the conclusion of Thm. 5.1.4. By Prop. 5.2.3 above, our axiom 
system SpecRelDyn implies neither ConsMass nor ConsMomentum. By Thm. 5.2.1, 
ConsMass and ConsMomentum together imply the key axiom AxCenter of SpecRelDyn. 
So Thm. 5.1.4 is stronger than the corresponding result in the literature since it requires 
fewer assumptions. 

Thm. 5.2.1 also states that the conservation axioms can be replaced by the natural, 
purely geometrical symmetry postulate AxCenter''' without loss of predictive power or 
expressive power. Since the conservation axioms ConsMass and ConsMomentum are not 
"purely geometrical" and they are less observation-oriented than AxCenter^, we think 
that it may be more convincing to use AxCenter or AxCenter"*" in an axiom system when 
we introduce the basics of relativistic dynamics. See [69, p. 22 footnote 22]. 

5.3 Four- momentum 

Neither relativistic mass nor linear momentum is Lorentz-covariant. However, they 
can be "put together" to obtain a Lorentz-covariant quantity called four-momentum, 

44 



as follows. Let k G Ob and 6 G IB. The four- momentum Pk{b) of inertial body h 
according to inertial observer k is defined as the element of Q*^ whose time component 
and space component are the relativistic mass and linear momentum of h according to 
k, respectively, see Fig. 5.5. i.e., 

Pfc(6)^ = mfe(6) and Pfc(6)a = mfc(6)vfc(6). 

It is not difficult to prove that P/fc(6) is parallel to the world- line of h and its Minkowski 
length is mo(6), see Prop. 5.3.1. Hence, it is indeed a Lorentz-covariant quantity. The 
four- velocity Vfc(6) of inertial body h according to inertial observer k is defined as 
g G Q"' if g^ > 0, /i(g) = 1 and g is parallel to vj\k{b). Let us note that v/;(&) is defined 
iff Vfc(6) is defined; and 

Mb) 



^k{b). 



VI - vkiby 



and Vk{b)a 



VI - Vkiby 



see Fig. 5.5. 





mfc(^)vfc(6) 



Figure 5.5: Illustration of four-momentum Pk{b) and four-velocity Vfc(6) 

Proposition 5.3.1. Let d>3. Assume SpecRelDyn and let k be an inertial observer 
and b an inertial body having rest mass. Then 

Pfc(6) = mo(6)vfc(6). 

Proof . By the definition of Pk{b), it is easy to see that 

KPkib)) = 



'l-Vk{b)^-mk{b). 
Hence by Thm. 5.1.4, /i(pfc(&)) = nio(6). Thus Pk{b) = mo(6)vfe(6). 
ConsFourMoment Conservation of four-momentum: 

Vfc G lOb V6, c, d G IB \neco\\k{b,c: d) -)■ pk{b) + pk{c) = pk{d) . 
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The following can be easily proved from the definition of four- moment. 

Proposition 5.3.2. ConsFourMoment ^-J- ConsMass A ConsMoment. 
Hence the following is an immediate corollary of Thm. 5.2.1. 

Corollary 5.3.3. 

AxMass + AxInMass + AxSelf + lOb C IB |= AxCenter+ o ConsFourMoment. 

Let us return to discussing the merits of using AxCenter+ in place of the more 
conventional preservation principles. In the context of Cor. 5.2.2, ConsFourMoment 
has the advantage that it is computationally direct and simple, while AxCenter+ has 
the advantage that it is more observational, more geometrical, and more basic in some 
intuitive sense. 

The following proposition shows the relation of ConsFourMoment and AxCenter. By 
Cor. 5.3.3, it also shows that AxCenter"*" is a strictly stronger axiom than AxCenter. 

Proposition 5.3.4. Assume AxMass and AxInMass. Then AxCenter is equivalent to 
the following formula 

VA; G Ob V6, c,d elB inecollfc(6, c : d) -^ 3A G Q Pfc(fe) + Pfc(c) = Xpk{d). 

Prop. 5.4.8 on p. 52 is an extension of this proposition. The following is a consequence 
of Props. 5.3.2 and 5.3.4. 

Corollary 5.3.5. 

AxMass + AxInMass + ConsMass |= AxCenter ^ ConsMoment. 
AxMass + AxInMass + ConsMoment |= ConsMass —^ AxCenter. 
AxMass + AxInMass + AxCenter |= ConsMass — t- ConsMoment. 

Remark 5.3.6. Let us, however, note that the two implications in the corollary 
above cannot be reversed since it is possible to construct a model in which there are 
an inertial observer k and inertial bodies 6, c and d such that inecollfc(6, c : d), Vfc(6) = 
— Vfc(c), Vk{d) = and mk{h) = mk{c) = mk{d); and in this model both AxCenter and 
ConsMoment hold while ConsMass does not hold. 

Let us finally state a theorem about the existence of nontrivial models of our axiom 
systems. The proof of Thm. 5.3.7 can be found in [6]. 

Theorem 5.3.7. SpecRelDyn+ U {lOb ^ 0} is consistent. 
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5.4 Some possible generalizations 

Let us formulate AxCenter in a more general setting. To do so, we introduce the set 
of bodies whose world-lines can be parametrized by differentiable curves according to 
any inertial observer: 

DB:= { 6 G B : VA; G lOb loc^ is a differentiable curve } . 

For the FOL definition of differentiability, see Section 10.3. 

AxCenterDiff If bodies b,c E DB collide inelastically originating body d G DB, the 
world-line of d is a differentiable continuation of the center-line of the masses of 
b and c according to any inertial ohseivei (see Fig. 5.6): 

yk G lOb V6, c, ci G DB Vt G Q inecollfc(6, c : d) 

A \oclit) = \ocl{t) = \octit) ^ (cent'^)'(t) = (locf)'(t). 



Vfcl 




cenfc(6,c) 



Figure 5.6: Illustration of axiom AxCenterDiff 



By the following proposition, AxCenterDiff is an extension of AxCenter. 

Proposition 5.4.1. AxMass + AxInMass |= AxCenterDiff — )■ AxCenter. 

On the proof . The proof is based on the following two facts: (1) IB C DB, and (2) 
AxMass and AxInMass imply that cenfc(6, c) is a line segment if /c G lOb, 6, c G IB such 
that inecollfc(6, c : d), see Prop. 5.1.2. 



It is also natural to generalize AxCenter to more than two bodies. Now we formulate 
some of the possible generalizations. 
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AxCentern After some two-by-two inelastic collisions of inertial bodies &i, . . . , 6„, their 
center-line of masses and the center-line of the masses of the last originated 
inertial body and the noncolliding bodies are in one line: 



VfceOb V6i,...,6„GlB 

l\ I Vdi, . . . , dj+i e\B di = hi A l\ inecollfc(rfi, 6i+i : (ij+i) 

— )■ cenfc(6i, . . . , 6„) U cenfc((ij_|_i, &J+25 • • • , &n) C £ for some line ^ 

AxCenter* During all successive two-by-two inelastic collisions of meriza/ bodies bi, . . . ,bn, 
the center-line of mass after each collision is the continuation of the center-line 
of mass before the collision: 

VA;GObV6i,...,6„GlB 

n.-l j 

/\ I Vdi, . . . , dj+i elB di = bi A f\ inecollfc((ii, k+i : di+i) -> 
i=i i=i 

cenfc(6i, . . . , 6„) U |J cenfc(4+i, 6^+2, • • • , 6„) C £ for some line ■ 

s=l<j 

AxCenter is just AxCenter2 or AxCenterj in these two series of axioms. Let us now 
decompose AxCentern and AxCenter* into the following fragments: 

AxCentern j After j two-by-two inelastic collisions of inertial bodies 61,..., 6„, their 
center-line of masses and the center-line of the masses of the last originated 
inertial body and the noncolliding bodies are in one line: 

j 
VA; G Ob V61, . . . , bn, di, . . . , dj+i G IB di = bi A f\ \neco\\k{di, k+i : di+i) 

i=l 

— )• cenfc(6i, . . . , 6„) U cenk{dj+i, 6j+2, ■ ■ ■ ,bn) C £ for some line i. 

The definition of AxCenter* j is analogous. Then AxCentern and AxCenter* are equivalent 
to 

n— 1 71—1 

A AxCentern J and A AxCenter* j, 

i=i i=i 

respectively. Let us now see some of the logical connections between the above two 

series of axioms. 

Proposition 5.4.2. Let x, y, n and m be natural numbers such that 1 < x < y < 
n < m. Then 
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(1) AxCenter |= AxCentern,y ^ AxCentern,x, and 
AxCenter |= AxCenter* y ^ AxCenter*^. 

(2) a.) AxCenter* y |= AxCenter* j^, but 
b.) AxCentern,y ^ AxCenterp^x- 

(3) a.) AxCenter*^ |= AxCentern,x, but 
b.) AxCentern,x ^ AxCenter* j^. 

(4) a.) AxCentern^x ^ AxCenterm,x, and 
b.) AxCenter* X ^ AxCenter;!;^ ^. 

Proof . Item (1) can be proved by induction on y — x. 
Item (2a) is true since 

y cenkids+i,bs+2,---,bn)'^ [J cenfc(4+i, &s+2, • • • , &«)• 

s=l<x s=l<y 

To prove (2b), let OJt be a model such that IB:={bi, . . . ,bn,di, . . . dx+i}, IOb:={A;}, 
wlk{k) = 0, 6i = di and AxCenternx is not valid. It is not difficult to see that there is 
such a model 9Jt. Since there are no n + y — 1 pieces of distinct bodies in the required 
collision situation, AxCenterp y is valid (its condition is empty). 

Item (3a) is true since 

cenfc(4.+i,6^+2,- • • ,fen) ^ [J cef\k{ds+i,bs+2,---,bn). 

s=l<x 

Item (3b) is true since its opposite together with (2a) and (3a) states that AxCenterp^x+i \= 
AxCenter* ^^;^ |= AxCenter* ^^ |= AxCentern^x which contradicts (2b). 

Item (4) is true since noncoUiding bodies cannot affect the center-line of mass. ■ 

Let us now introduce an axiom about general situations of two-by-two inelastic 
collisions. Let Sqn be the set of at most n-long sequences of natural numbers between 
1 and n. Let us denote the concatenation of sequences a,b ^ Sqn by ab . We say that 
Gin is a generalized index set of basis n if 

• {!,...,«} C Gin C Sqn] 

• fli < . . . < flfc if ai" . . . "flfc G Gin and ai, . . . , Cfc G {1, . . . , n}; and 

• For all a G Gin \ {1, . . . ,n}, there is a unique decomposition a = Kc such that 
b,c^ Gin. 
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AxCenter*Gln During a collision situation of type Gin of inertial bodies &i, . . . , 6„, the 
center-line of mass after each collision is the continuation of the center-line of 
mass before the collision, see Fig.5.7: 

VA; e Ob (V6i elB ie Gin) /\ inecollfc(&i, bj : b^j) 

— )■ M cenfc(6jj, bj2, ■ ■ ■ , bj^) C i for some line i. 

jl,...,jm&GI„ jl'...~jm. = l'--'n 



hv 



1'2~3"4"5 




Figure 5.7: Illustration for AcCenter*Gln if G/„ = {1, 2, 3, 4, 5, r2, rs, 3"4"5, 1"2"3"4^5} 

Let us note that every two-by-two inelastic collision can be coded by a generalized 
index set, but not every generalized index set can code a collision situation, e.g., gen- 
eralized index set {1, 2, 3, r2, r3} does not correspond to a collision situation. In such 
a case AxCenter*Gln states nothing since its conditions cannot be satisfied. 

Theorem 5.4.3. Let n be a natural number and let Gin be a generalized index set. 
Then 

AxCenter -|- AxCenter^ |= AxCenter*Gln. 

On the proof The theorem can be proved by induction on the complexity of the gen- 
eralized index set (?/„. ■ 
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Corollary 5.4.4. Let n be a natural number. Then 

AxCenter + AxCenter'^ |= AxCenterp, and 
AxCenter + AxCenter'^ |= AxCenter*. 

Let us finally generalize AxCenter to noninelastic collisions, too. 

AxCentern:m If inertial bodies 61, . . . , 6„ collide originating inertial bodies t/i, . . . , dm-, 
the center-line of the masses of di, . . . , (im is the continuation of the center-line 
of the masses of fei, . . . , fe„: 

V/c G Ob V61, . . . , 6„, di, . . . , (im G IB collfc(6i, . . . ,dn : di, . . . , dm) 

-7- cen,fc(6i, . . . , 6„) U cen^^di, . . . , dm) C i for some line i. 

AxCenter is just AxCenter2:i in this series of axioms since inecollfc(6, c : d) is the same 
formula as collfc(6, c : d). 

Remark 5.4.5. {AxCentern-m : ra, ?7i G w} is an independent axiom system. 

Let us now extend the formula expressing the conservation of four-momentum of 
two inelastically colliding merfza/ bodies (ConsFourMomentum) to general collision sit- 
uations. 

ConsFourMomentumn:m If mertza/ bodies 61, . . . , fe„ collide and originate meriza/ bodies 
di, . . . , dm, the sum of four-momentums oi di, . . . ,dm and 61, . . . , 6„, is the same: 

Wk G lOb Wbi, . . . , 6„, di, . . . , dm G IB 

n m 

co\\k{bi,...,bn:di,...,dm) -> ^Pkik) = ^Pk{dj). 

Remark 5.4.6. It is suggested by Rindler to assume all the formulas above as axioms 
of relativistic dynamics, see [52, p. 109]. However, the formulas ConsFourMomentn.m 
are not natural and observational enough assumptions to regard them as axioms. By 
Cor. 5.4.9, we can offer a list of more natural formulas to be assumed, which is equivalent 
to the list above. 

Remark 5.4.7. The following theory is consistent and independent: 

SpecRel U { lOb ^ 0, AxThExp } U { AxInMass, AxMass } 

U { AxVcolln:m, ConsFourMomentn:m : n,m e u} , 

where AxVcolln:m is a generalization of AxVinecoll which ensures the realization of every 
possible collision of type "n : m." 
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The following proposition extends Prop. 5.3.4. It also shows the logical connection 
between AxCentern-m and ConsFourMomentn:m- 

Proposition 5.4.8. Assume AxMass and AxInMass. Let k be an merim/ observer and 
let bi, . . . ,bn be inertial bodies. Then the following is equivalent to AxCentern:m: 

VA; e lOb ^bi,. . .,bn,di,. . . ,dm& IB 

n m 

co\\k{bi,...,bn: dl,...,dr,^) -> 3A e Q ^Pkih) = X^pkidj). 

i=i j=i 

Proof . By Lem. 5.4.10, cenfc(6i, . . . ,bn) and cer\k(di, . . . , dn) are parallel iff J27=i Pfc(^i) 
and X]j=i Pk{dj) are parallel. We have that cenfc(6i, . . . , 6„) fl cenk{di, . . . , dn) 7^ if 
collfc(6i, . . . ,bn '■ di, . . . , dm)- Thus AxCentern:m holds iff 

n m 

^Pk{bi) = X^Pkidj) 

for some A G Q. Hi- 

Let us now introduce a list of axioms which are formulated in the spirit of AxCenter 
and whose elements are equivalent to the corresponding ConsFourMomentumn:m formu- 
las. 

AxCenter^^ If a is an inertial body and inertial bodies bi, . . . ,bn collide originating 
inertial bodies di, . . . , dm, the center-line of the masses of a, 61, ... , 6„ is the con- 
tinuation of the center-line of the masses of a, di, . . . , dm, i-e., there is a line that 
contains both of them: 

\fk e lOb \/bi,...,bn,di,...,dme IB collfc(6i, . . . ,bn : di, . . . ,dm) 

— )■ cenfc(a, 61, . . . , 6„) U cenfc(a, di, . . . , dm) C £ for some line i. 

The following corollary can be proved from Prop. 5.4.8 in a strictly analogous way 
to the proof of Cor. 5.3.3. 

Corollary 5.4.9. 

AxInMass + AxMass + AxSelf + lOb C IB |= Center^^ ^-> ConsFourMomentumn:m 

Lemma 5.4.10. Assume AxMass and AxInMass. Then for all k E lOb and 61, . . . , 6.„ G 
IB, 

n 

certkibi, ...,bn) is parallel to ^ Vk{h) 
if cenfc(6i,...,fe„) 7^ 0. 
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i=l 



Proof . We prove the statement by induction on n. It is clear if n = 1 since Pk{b) is 
parallel to cer\k{b) = wlfc(6) if AxMass and AxInMass are assumed, see Prop. 5.1.2. Now 
we have to prove that if it is true for n, then it is also true for n+1. Let 61, . . . , 6„, fc^+i G 
IB. Since cenfc(6i, . . . ,6n+i) 7^ and translations preserve parallelism, we can assume 
that o G cenfc(6i, . . . ,6„+i). By the induction hypothesis, we can expand our model 
with inertial body c such that wlfc(c) = cenfc(62, • • • , &n+i) and Pfc(c) = X]r=2 Pkih)- 
Since cen^^bi, . . . ,bn) = cenfc(6i,c), we have to prove that cenfe(6i,c) is parallel to 
Pfc(^i) + Pfc(c). Without losing generality, we can assume that wlfe(6i) and wlfe(c) are 
lines. Then 

- /, X mfc(&i) + mfc(c) , ^ , . mfc(fei) + mfc(c) 

Pfc(oi) 77— and pfc(c) 



mfc(6i) mfc(c) 

have the same time component mk{bi) + mfc(c); and they are in vj\k{bi) and wlfc(c), 
respectively. So cen^^''^(mfe(6i) + m/,.(c)) = Pa:(&i) + Pfc(c) since o G cenfc(6i,c). Hence 
cenfc(6i, c) and PA:(fei) + Pfc(c) are parallel; and that is what was to be proved. ■ 

5.5 Collision duality 

To every formula (p in the language of SpecRelDyn, we can define its collision dual 
(/?® in which the subformulas of the form collfc(6i, . . . , 6n : di, . . . , dm) are replaced by 
co\\k{di, . . . ,dm '■ bi, . . . ,bn), i.e., the incoming and the outgoing bodies in the colhsions 
are interchanged. By this definition of collision dual, it is clear that (y?®)® and ip are 
the same. Let S be a set of formulas. The set of collision duals of the formulas of S is 
denoted by S® . 

Formula (f is called self-dual if if and (y9® are the same. Every formula which is 
not about collisions is self-dual. So most of the axioms of SpecRelDyn are self-dual. 
There are also self-dual axioms about collisions, e.g., AxCentern:n is such for all natural 
number n. 

Proposition 5.5.1. Let E be a set of formulas and let (p be formula in the language 
of SpecRelDyn. Then 

S 1= (^ iff S® 1= (^®. 

Proof . By Godel's completeness theorem, H \= if holds iff S h </), i.e., there is a formal 
proof of (f from S. It is clear that S h </) iff S® h (/)® since we get a formal proof of 
y9® from S by replacing every formula by its collision in the formal proof of (f from S. 
From this, we get S® |= ^9® by Godel's completeness theorem. ^ 
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By applying Prop. 5.5.1 to Thin. 5.1.4, we get the following as its conclusion is 
self- dual. 

Corollary 5.5.2. Let d > 3 and assume SpecRelDyn®. Let k be an meritm/ observer 
and b be an inertial body having rest mass. Then 

mo(6) = a/1 -Vk{hY ■ mk{h). 

Remark 5.5.3. It is natural to interpret inecoll as nuclear fusion. By this interpre- 
tation inecoll® becomes nuclear fission. In this case axioms AxVCenter® and AxVCenter 
(and even Ax3Center® and Ax3Center) are too strong since they require the existence of 
several fusions and fissions which might not exist in nature. However, it is not a prob- 
lem since all the theorems which use these axioms can be reformulated without them by 
building them into the statements. So instead of assuming that certain fusion/fission 
situations exist and proving a statement from that, we can omit this assumption and 
prove the statement only for the bodies which appear in the corresponding fusion/fission 
situations. 

5.6 Concluding remarks on dynamics 

We have introduced a purely geometrical axiom system of special relativistic dynamics 
which is strong enough to prove the formula connecting relativistic and rest masses of 
bodies. We have also studied the connection of our key axioms AxCenter and AxCenter"*" 
and the usual axioms about the conservation of mass, momentum and four-momentum. 
We saw that the conservation postulates are not needed to prove the relativistic mass 



increase theorem rriQ = a/1 — f^/c^ ■ ?n, see Prop. 5.2.3 at (p. 44) and Thm. 5.1.4 at 
(p. 40). See also [69, p. 22 footnote 22]. Connections with Einstein's insight E = mc^ 
have also been discussed. The contents of the present chapter represent only the first 
steps towards a logical conceptual analysis of relativistic dynamics. A glimpse into 
Chap. 6 (pp. 108-130) "Relativistic particle mechanics" of the textbook by Rindler [52] 
suggests the topics to be covered by future work in this line. In another direction, 
looking at the logical issues in [2] and [4] suggests questions and investigations to be 
carried out in the future about the logical analysis of relativistic dynamics. 
Let us mention here two tasks that should be done in the future: 

Question 5.6.1. Analyzing the possibility/impossibility of faster than light motion 
of colliding bodies within axiomatic special relativistic dynamics similarly to what was 
done for observers within special relativistic kinematic, see, e.g., [4, Thm. 11. 7], [38, 
Thm.3, Thm. 5]. 
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Question 5.6.2. Extending tlie axiomatization of relativistic dynamics for acceler- 
ated observers. 

A work related to tliis cliapter witli somewliat different aims is [56]. 
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Chapter 6 

Extending the axioms of special 
relativity for accelerated observers 



In this chapter we extend our axiomatization of special relativity to non-inertial ob- 
servers, too. Non-mertza/ observers are also going to be called accelerated observers. 
We have two reasons for extending our aproach to accelerated observers: to take a step 
towards a FOL axiomatization of general relativity (see Chap. 9) and to provide an ax- 
iomatic basis of the twin paradox and other surprising predictions of special relativity 
extended to non-inertial observers. The results of this chapter are based on [5], [35] 
and [73]. A further aim is to prove predictions of general relativity from our theory of 
accelerated observers, by using Einstein's equivalence principle, cf. our interpretation 
of answering why-questions at p. 4 or [74]. 

6.1 The key axiom of accelerated observers 

It is clear that SpecRel is too weak to answer any nontrivial question about acceleration 
since AxSelfo is its only axiom that mentions non- merim/ observers. To extend SpecRel, 
we now formulate the key axiom about accelerated observers. It will state that the 
worldviews of accelerated and inertial observers are locally the same. 

To connect the worldviews of the accelerated and the inertial observers, we formu- 
late the statement that, at each moment of its world-line, each accelerated observer 
coordinatizes the nearby world for a short while as an inertial observer does. To for- 
malize that, first we introduce the relation of being a co- moving observer. Observer m 
is a co-moving observer of observer /c at g* G Q'', in symbols m >~q- /c , iff g G Dom w^ 
and the following holds: 

Ve G Q+ 35 G Q+ Vp G Bs{q) nDomw^ km,(p) - p\ < £ ■ \p-q\- 
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Behind the definition of co-moving observers is the following intuitive image: as we 
zoom in the neighborhood of the coordinate point, the worldviews of the two observers 
are getting more and more similar. 

Remark 6.1.1. Let us note that g'G Cdm and eVmiq) = evk{q) if m >~q- k. It can be 
proved by choosing p as qE Bs{q) fl Dom w^. 

Remark 6.1.2. By Conv. 2.4.2, there is a 5 G Q"*" such that w^ is a function on 
Bs{q) n Dom w^ if m y^ k. So w^ is a function on a small enough neighborhood of q 
a m >~q k and Dom w^ is open. 

The relation >~^ is transitive but it is neither reflexive nor symmetric. It is not 
reflexive because if k is an observer such that ewfc((0, -, . . . , 0)) = eVk{o) for all 
new, then w^ is not a function on any neighborhood of o. Thus k ]/-g k see Rem. 6.1.2. 
Example 6.2.4 shows that >~^ is not symmetric. The relation y^ becomes an equivalence 
relation, e.g., if w^ is a function and deflned in a small enough neighborhood of g'for 
each k,m, E Ob. This will be the case in our last axiom system GenRel in Chap. 9. 

Now we can formulate the key axiom of accelerated observers, called the co-moving 
axiom. This axiom is about the connection between the worldviews of inertial and 
accelerated observers: 

AxCmv For every observer and event encountered by it, there is a co- moving inertial 
observer: 

V/c G Ob Vg G Q'^ k e evk{q) -^ 3m G lOb m y^ k. 

Remark 6.1.3. Let us note that from AxCmv and AxEv follows that inertial observers 
coordinatize every event encountered by an observer, i.e., e G Evm for all event e and 
inertial observer m whenever k E e E Evk for some observer k. That is true since 
inertial observers coordinatize the same events by AxEv; there is an ?72 G lOb such that 
my^k and evk{q) = e by AxCmv; and g G Cdm if m y^ k, see Rem. 6.1.1. 

Before we go on building our theory of accelerated observers, let us prove a propo- 
sition reformulating the co-moving relation. For the notion of differentiability in our 
framework, see Section 10.3. Let us note that in our framework a function differentiable 
at g*may have several derivatives at q. 

Proposition 6.1.4. Let m and k be observers and g* be a coordinate point. The 
following two statements are equivalent: 

(a) m >-g> k, i.e., m is a co-moving observer of k at q. 
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(b) w^{q) = q, w^ is differentiable at g, and one of its derivatives at g is the identity 
map. 

Proof . To prove (a) =^ (6), let m >~g- k. Then q G Domw^ by definition. By 
Rem. 6.1.2, we have that w^ is a function on Bs^Xq) H Domw^ for some 5q G Q''". 
Thus w^{q) is defined, and it is4s qhy Rem. 6.1.1. By the definition of m >~^ k, for 
all e G Q"*", there is a 5 G Q"*" such that the inequality Iw^i^p) — p | < elp — q\ holds 
for all p G Bs{q) fl Domw^. Since w^{q) = q, this inequality can be rewritten as 
I'^mip) ~ '^mi.'i) ~ Id{j) — q)\ < e\p — q\. So w^ is differentiable at g and one of its 
derivatives at q is the identity map. 

To prove the converse implication, let w^ be differentiable at q such that one of its 
derivatives at g is the identity map, and w^{q) = q. Then for all e G Q'^, there is a 5 G 
Q"*" such that \w'^{p) —w^{q) — Id{p —q)\ < e\p—q\ holds for allp G Bs{q)nDomw'^. 
And, since w^{q) = g, this last inequality is the same as \w^{p) —p\ < e\p —q\. Thus 
m y^ k. H. 

The world-line of an observer represents the set of coordinate points where the 
observer is during its life but it does not tell how "old" the observer is at a certain 
event. So let us define the life-curve Ic^ of observer k according to observer m as the 
world-line of k according to m parametrized by the time measured by k, formally: 

ICm ■■= {{t,p) eQxQ'^ : 3g G Q"' ke evk{q) = ev^{p) A g, = t } . 

For the most important properties of life-curves, see Prop. 6.1.6. 

Let the natural embedding l : Q — )■ Q'^ be defined as t(a:):=(x, 0, . . . , 0) for all 
X G Q. 

Lemma 6.1.5. Assume AxSelfg. Let k and m be observers. Then \c^:=l o w^. 

Proof . By our definitions, 

{ {tj) G Q X Q'^ : 3g G Q"^ evk{q) = eVm{p) 7^ A g^ = t A g; = o } . 



LOW^^ 



k 
m 

k i 



By AxSelfo and the fact that evk[q) ^ 0, we have g^ = o iff A; G ewfc(g). So Ic: 

LOW, 



Let us introduce here a very natural axiom about observers, which is going to be 
used in the following proposition. 

AxEvTr Every observer encounters the events in which it has been observed: 

Vm G Ob Ve G Ev m E e — ;■ e G Evm- 
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Proposition 6.1.6. Let m, k and h be observers. Then 
(1) Icl is a function iff 



~m 



(i) event e lias a unique coordinate in Cdm whenever k E e ^ Ev^ H Ev^, and 

(ii) evk{q) = evk{q') if q,q' G Cdk such that evk{q),evk{q') G Evm, k G 
evk{q) n et;fc(g') and qr = q'r- 

(2) Ic^ is a function if m is inertial, and AxPho and AxSelfo are assumed. 

(3) Ic^ ^ Ic^ o w^ always holds, and 

Ic^ = Ic^i, o w^ holds if we assume Evm C Evk- 



(4) {q-r : k G evk{q)} = Dora \c\ 3 Doni Ic^ always holds, and 



DoTTi Ic^ = Dom Icr, holds if we assume AxCmv and m G lOb. 



(5) Ran Ic^ C wlm(fc) always holds, and 
Ran Ic^ = wlm(/c) if we assume AxEvTr. 

Proof . Item (1) is a straightforward consequence of the definition of Ic^. To see that, 
let R:={{t,q) G Q x Cdk '■ k G evk{q) A qr = t}. Then Ic^ = Row^ = RoeVko\oCm- 
Since ef^ is a function and loCm is an inverse of a function, it is easy to see that 
Ic^ is a function iff \oCm is a function on Ran {R o eVk) and i? o ef ^ is a function to 
Dom \oCm = Evm- It is clear that loCm is a function on Ran {R o evk) iff (i) holds; and 
it is also clear that i? is a function to Dom \oCm = Evm iff (ii) holds. Hence Ic^ is a 
function iff both (i) and (ii) hold. 

To prove Item (2), we should check (i) and (ii) of Item (1). By Item (1) of Prop. 3.1.3, 
(i) is true. By AxSelfo if A; G evk{q) fl evk{q'), then q^ = o = q'^. Thus if g^ = q'^ also 
holds, then q = q'. Hence (ii) is also true. 

To prove Item (3), let {t,p) G Ic^ °uj'^. That means 3c G Cdk such that (t, c ) G Ic^ 
and {c,p) G w^, which is equivalent to 3g G Cdh such that h G evh{q) = evk{c), 
q^- = t and eVk{c) = eVm{p)- Thus (t,p) G \c^. To prove the converse inclusion, 
let (t,p) G Ic^. That means that there is a coordinate point q G Cd^ such that 
h G evh{q) = eVmip) and g^ = t. By the assumption Evm ^ Evk, we have that 
3c G Cdk such that efA;(c) = evm{p)- Thus (t,p) G Ic^ o w'^. That proves Item (3). 

To prove Item (4), let us recall that t G Dom\c^ iff there are p G Cdm and 
g* G Cfifc such that /c G eVm{p) = GVk{q) and g^- = t. From that, it easily follows that 
t G Dom\Cf^ iff there is a coordinate point g* G Cdk such that qr = t and /c G eVk{q)- 
Thus {g,- : A; G effc(g)} = Dom\Ck ^ Dom\Cm is clear; and if we assume AxCmv and 

59 



m G lOb, then Dom Ic^. C Dom Ic^ is also clear since inertial observers coordinatize 
every event encountered by observers, see Rem. 6.1.3. 

To prove Item (5), let us recall that p G Ran \c^ iff p G Cdm and there are t G Q and 
q G Cdk such that k G eVm{p) = evk{q) and Qr = t. Thus Ran\c^ C \N\m{k):={p G 
Cdm : k G evmip)} is clear. If p G wlm(A;), then k G eVmip)- Therefore, by AxEvTr, 
we have that eVmip) G Evk- Thus there is a coordinate point q G Crffc such that 
e-Vmip) = evk{q). Hence Ran\c^ = \N\m{k). 



We call a timelike curve a well-parametrized if /i(a'(t)) = 1 for all t G Dom a. 
For the FOL definition of a', see Section 10.3. 

Assume = M. Then curve / is well-parametrized iff / is parametrized according 
to the Minkowski length, i.e., for all x, y G Dom /, the Minkowski length of / restricted 
to [x, y] is y — X. (By the Minkowski length of a curve we mean length according to 
the Minkowski metric, e.g., in the sense used by Wald [81, p. 43, (3.3.7)]). If the proper 
time is defined as the Minkowski length of a timelike curve, see, e.g., Wald [81, p. 44, 
(3.3.8)], Taylor-Wheeler [75, 1-1-2] or d'Inverno [17, p. 112, (8.14)], a curve defined on 
a subset of M is well-parametrized iff it is parametrized according to proper time (see, 
e.g., [17, p. 112, (8.16)]). Hence for well-parametrized curves, our definition of proper 
time (see p. 16) coincides with the definition of the literature. 

Example 6.1.7. Let us list some examples of well-parametrized curves here: 

(1) 7(t) = 1/2 ■ (tV3 - 1/t, tV3 + 1/t, 0, . . . , 0) for all t G Q+. 

(2) 7(t) = (v^/3 + Vt, 7^3/3 - v^t, 0, . . . , 0) for all t G Q+. 

(3) 7(t) = (a ■ sh{t/a), a ■ ch{t/a), 0, . . . , 0) for all aeR-^ and t G M. 



(4) 7(t) = {Va?~+T ■ t, cos(a ■ t), sin(a ■ t), 0, . . . , 0) for all a G IR+ and t G M. 

Let us note that examples (1) and (2) do not have well-parametrized extensions. De- 
finability in Prop. 6.1.8 is meant in the same way as in Sec. 7.2. 

Proposition 6.1.8. The vertical timelike unit-hyperbola 

Hyp:={peQ'^ : pj - p^^ = l,p^ = . . . = pa = O} 

can be well-parametrized by a definable curve iff an exponential function is definable 
over Q, i.e., there is a definable well-parametrized curve 7 : Q — )■ Q'^ such that Rawy = 
Hyp iff there is a definable function e : Q — )■ Q such that e'{t) = e{t) and e(— t) = l/e(t) 
for all t G Q. 
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Proof. Using the fact that the Minkowski distance of Ix and the points of Hyp 
tend to infinity in both directions, it can be proved that Dom'-f = Q for any well- 
parametrization 7 of Hyp. Let 7 = (71, 72, . . . , 0) be a definable well-parametrization 
of Hyp. Then 

l2{tf-li{tf = l for all teQ. 
By differentiating both sides of this equation, we get that (see Sec. 10.3) 

72(t)72W-7iW7iW = for all t G Q, 

which means that 7(t) and 7'(t) are Minkowski orthogonal since 7' = (7J, 72, . . . , 0). 
Since 7 is well-parametrized, 7'(t) is of Minkowski length 1 for all t G Q. Hence 
7i(^)^ ~ 72(^)^ — 1 f*^^ all t G Q. So 7(t) and 7'(t) are two Minkowski orthogonal 
vectors of the te-Plane, for which yu(7(t)) = —1 and yu(7'(t)) = 1. Thus there are 
two possibilities: either (1) 7i(t) = 72(i) and 72(1^) = l[it), or (2) 7i(t) = — 72(t) and 
72 (^) = —li{t)- From the differentiability of 7 it follows that only one of these two cases 
can hold for all t G Q. Let e(t):=7i(t) -|-72(t) in case (1) and let e(t):=7i(— t) +72(— t) 
in case (2). Then e : Q ^^ Q is a definable differentiable function for which e'(t) = e(t) 
and e(-t) = l/e(t) for all t G Q. 

To prove the other direction, let e : Q — ?■ Q be a definable differentiable function such 

that e(ty = e(t) and e(— t) = l/e(t) for all t G Q. Then let us define functions ch and 

sh as follows: 

,,, e(t) + e(-t) , ,,, e(t)-e(-t) , 
ch{t): = ^^ ^ — - and sh{t): = ^^ ^ — - for all t G Q. 

Then the following can be shown by a straightforward calculation: 

ch'{t) = sh{t), sh'{t) = ch{t) and ch{tf - sh{tf = 1 for all t G Q. 

From these equations it is not difficult to prove that the following curve is a definable 
well-parametrization of Hyp: 

'j{t): = {sh{t),ch{t), 0...,0) for all t G Q. 

That completes the proof of Prop. 6.1.8. M' 

Remark 6.1.9. It is well known that uniformly accelerated motion and hyperbolic 
motion are the same, see [17, §3.8]. Thus according to merim/ observers, the world-line 
of a uniformly accelerated observer is the unit-hyperbola Hyp distorted by a Poincare 
transformation and a dilation. Thus Prop. 6.1.8 implies that there can be uniformly 
accelerated observers iff an exponential function of the quantities is definable. See 
Question 8.3.3. 
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Let us introduce an axiom here that we will use to strengthen AxSelfo: 

AxSelf^ The set of time-instances in which an observer encounters an event is con- 
nected and has at least two distinct elements, i.e., 

V/c G Ob 3p, g G Q"^ Pt 7^ g^ A /c G evk{p)rievk{q) A {r^ : k G eVk{r)} is connected. 

Let us note here that axioms AxSelfo and AxSelfp'' together are still weaker than AxSelf . 
Let now introduce an axiom system which is the extension of SpecRel by AxCmv 
and some simplifying axioms: 



AccRelo:= { AxSelfo, AxSelfo+, AxPh, AxEv, AxEvTr, AxSymDist, AxCmv } 



Remark 6.1.10. AccRelo is an extension of SpecRel since AxSelf is implied by AxSelfo, 
AxPh and AxEv, see Prop. 3.1.3. Moreover, AccRelo is a conservative extension of 
SpecRel with respect to accelerated observers. 

Our next theorem states that life-curves of accelerated observers in the models of 
AccRelo are well-parametrized. That implies that in the models of AccRelo, accelerated 
clocks behave as expected. And Rem. 6.1.12 states a kind of "completeness theorem" 
for life-curves of accelerated observers. 

Theorem 6.1.11. Let d > 3. Assume AccRelo. Let k be an observer and m be an 
inertial observer. Then Ic^ is a well-parametrized timelike curve. 

Proof. By (2) in Prop. 6.1.6 we have that lc„ is a function. To prove that Ic^ is 
also a curve, we need to show that Dam lc„ is connected and has at least two distinct 
elements. That is so because by Item (4) in Prop. 6.1.6, Dam Ic^ = {g,- : k G et'fc(g')} 
and the latter is connected and has at least two distinct elements by AxSelfg^. Hence 
Ic^ is a curve. 

To complete the proof, we have to show that Ic^ is also timelike and well-parametrized. 
Let t G Dam lc„. We have to prove that Ic^ is differentiable at t and its derivative at t 
is of Minkowski length 1. By (4) of Prop. 6.1.6, there is a g G Cdk such that k G evk{q) 
and Qr = t. Thus, by AxCmv, there is a co-moving merim/ observer of k at g. By AxSelfo, 
i(t) = g. By Prop. 10.3.13, we can assume that ?Ti is a co-moving inertial observer of k 
at g, i.e., m >-^ k, because of the following three statements. By (3) of Prop. 6.1.6 and 
AxEv, for every h G lOb, both lc„^ and Ic^ can be obtained from the other by composing 
it by a worldview transformation between merim/ observers. By Thm. 3.2.2, worldview 
transformations between inertial observers are Poincare-transformations in the models 
of SpecRel. Poincare-transformations are affine and preserve the Minkowski distance. 
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So let us assume that m is a co- moving inertial observer of fc at g = t(t). We 
prove that lc„ is differentiable at t and It = (1,0,...,0) is its derivative. This will 
complete the proof since Ij is a timelike vector of Minkowski length 1. By Lem. 6.1.5, 
Ic^ = L o w^-^. So by Chain Rule, the derivative of IcJ^ at t is the derivative of w^ at 
i{t) evaluated on the derivative of l at t, i.e., (Ic„)'(t) = d^w^^i^i' [t]^ . By Prop. 6.1.4, 
dfjw^ = Id since m )^^ k. It is clear that t'(t) = Ij. Thus (lc^)'(t) = Ij as it was 
stated. ^^ 

Let us note that we have not used AxEvTr in the proof of Thm. 6.1.11. 

Remark 6.1.12. Well-parametrized curves are exactly the life-curves of accelerated 
observers in the models of AccRelo, by which we mean the following. Let H be an 
Euclidean ordered field and let / : Q ^> Q'^ be well-parametrized. Then there are a 
model OJt of AccRelo, observer k and inertial observer m such that Ic^ = / and the 
quantity part of 971 is £}. That is not difficult to prove by using the methods of the 
present work, see Thm. 6.2.2. 

The co-moving relation :^^ is not symmetric while the intuitive image behind it is. 
Therefore, let us introduce a symmetric version, too. We say that observers m and 
k are strong co-moving observers at g*, in symbols m H^ k, iff both m >~q- k and 
k y^ m hold. The following axiom gives a stronger connection between the worldviews 
of inertial and accelerated observers: 

AxSCmv For every observer and event encountered by it, there is a strong co- moving 
inertial observer: 

VA; G Ob Vg G Q'^ k e evk{q) -^ 3m e lOb m Xg- k. 

Theorem 6.1.13. Let d > 3. Assume AxSCmv and SpecRel. Let h and k be observers 
and let g'be a coordinate point such that g'G \N\k{k)r\\N\k{h). Then the worldview trans- 
formation w^ is differentiable at g'and one of its derivatives is a Lorentz transformation. 



Proof . By axiom AxSCmv, there are inertial observers ho and k^ such that h^ >~^ h and 

hoV'J! - y - "^fc W;' ^^^^ ^ho ""'^ ""'''' "^" -^^flF— ^^"1 



k >~gko. By Prop. 6.1.4, w^iq) = q = w'^{q), and w^ and Wf!' are differentiable at q 



and one of their derivatives at g*is the identity map. By Thm. 3.2.2, Wff' is a Poincare 
transformation. So Wf.'^ is differentiable and its derivative is a Lorentz transformation. 
Hence, by Thm. 10.3.6, the composition of u;^^, w^° and Wj.° is differentiable at g and 
one of its derivatives is a Lorentz transformation. By Prop. 2.4.3 this composition 
extends w'^. So w\ is also differentiable at g and one of its derivatives is a Lorentz 
transformation, see Rem. 10.3.2. ■. 
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6.2 Models of the extended theory 

First let us note that it is easy to construct nontrivial models of AccRelo, for example, 
the construction in Misner - Thorne - Wheeler [43, §6, especially pp. 172-173 and §13.6 
on pp. 327-332] can be used for constructing models for AccRelo. 

To characterize the worldview transformations between inertial and accelerated ob- 
servers in the models of AccRelo, let us introduce the following definition. A function 
/ : Q'^ ^^ Q'^ is called worldview compatible iff { p^ : p G Dom f Ap^j = o } is con- 
nected and has at least two distinct elements, / is differentiable at every p G Q'^ for 
which p G Dom f and p„ = o, and one of its derivatives is a Lorentz transformation at 
p in this case. 

Remark 6.2.1. The worldview transformation w'^ between observer k and inertial 
observer m is worldview compatible if d > 3 and AccRelo is assumed. This can be 
proved by using Thms. 6.1.11 and 3.2.2 and the fact that {pr '■ p G Domw'^ ^ Pa = 
o} = {pr '■ k G evk{p)}, which follows by Rem. 6.1.3. 

Theorem 6.2.2. Let / : Q"^ — )■ Q'^ be a worldview compatible function. Then there 
is a model of AccRelo, and there are an observer k and an inertial observer m in this 
model such that w^ = f. 

Proof . We construct a model of AccRelo over the field Q. Let 

Ph:= { line{p, q) : p,qeCf A % - g^l = \pr - qA] , 

IOb:={m^ : r e Dom f A fv = o } U {m}, Ob:=IOb U {A;} B:=Ob U Ph. 

To finish the construction of the model, we should give the worldview relation W, 
too. Instead, it is enough to give the event functions of all observers or to give the 
event function of one particular observer and the worldview transformations that define 
the event functions of the other observers. So let us first give the event function of 
observer m. For all r G Dom / if fv = o, let rf^/ be the Lorentz transformation which 
is a derivative of / at r (since / is worldview compatible, there is such a Lorentz 
transformation). Let 

ph G eVm{p) iff p e ph^ m G eVm{p) iff p^ = o, 

k G eVm{p) iff p = f{r) for some f G Dom f for which f„ = o, 
rUff G evmip) iff P = /(^) + -^ ' di^fijt) for some A G Q, i.e., 
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iff the line{p, /(^)) is the tangent hne of wlm(fc)- Now we have arranged every body in 
the events observed by m, thus we have given the event function evm- Since c?^/ is a 
Lorentz transformation so is its inverse [cir /] . Let 

<,(p):=K/]"'(p-/(n)+r, 

which is a Poincare transformation since \dpf'\ is a Lorentz transformation. And 
let w'^:=f . Now we have given the model since the worldview relation W can be 
defined from the worldview transformations and eVm- Let us check the axioms. It 
is easy to see that AxSelfo is valid by the definition of eVm- AxSelfg" is valid since 
{pr '■ P ^ Dom f Apa = o} is connected and has at least two distinct elements. AxEv, 
AxPh and AxSymDist are valid by the definition of eVm and the fact that w^^ are 
Poincare transformations. AxEvTr is valid by the definition of eVm and w^. To prove 
that AxCmv is valid, we show that rrif >~r k, i.e., ttt,^ is a co-moving inertial observer of 
k at r. By Prop. 6.L4, we have to check two things, (1) w^^^f) = r and (2) w^^ is 
differentiable at f and the identity map is one of its derivatives. 



^L(r; 



[drfr\fir)-fin)+r = f 
since [(i^/]~^(o) = o by the linearity of [df^f]~^. By Thm. 10.3.6, 

drW'^^f) = dr^f O dfiP)WZ^^ = dr'f o [drf]'^ = Id(^d 

since the derivative of w^^ at f{r) is its linear part [c/^/]^^. That completes the proof 
of the theorem. H/ 





Figure 6.1: Illustration for Example 6.2.4. 



Remark 6.2.3. Let / be a worldview compatible transformation. It is not hard to 
see, by the proof above, that we can extend any model of AccRelo and m G lOb such 
that w^ = f for some k G Ob in the extended model. 
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Example 6.2.4. The function 



(0,^;) iff \Pr\ < IPa? 

-HP 



fipy-={ (^t^,p.) iff \p.\' < \Pr\ < m\' 



p iff 2\p^\'<\pr\, 

see Fig. 6.1, is worldview compatible, thus it can define a worldview of an accelerated 
observer. Nevertheless, / is not injective in any neighborhood of the origin. 

Remark 6.2.5. AccRelo is flexible enough to allow an accelerated observer's coordi- 
nate domain to be a subset of the time-axis, i.e., there can be an observer h such that 
Pfj = o for all p G Cd^. Observers of this kind behave as accelerated clocks because 
they only use the time coordinate of their coordinate systems, so we can use them to 
define accelerated clocks within AccRelo. 
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Chapter 7 

The twin paradox 



The results of this chapter are based on [35] and [73]. Here we investigate the logical 
connection of our accelerated relativity theory and the twin paradox, which is the 
accelerated version of the clock paradox, see Chap. 4. According to the twin paradox 
(TwP), if a twin makes a journey into space (accelerates), he will return to find that he 
has aged less than his twin brother who stayed at home (did not accelerate). However 
surprising TwP is, it is not a contradiction. It is only a fact that shows that the concept 
of time is not as simple as it seems to be. 

A more optimistic consequence of TwP is the following. Suppose you would like 
to visit a distant galaxy 200 light years away. You are told it is impossible because 
even light travels there for 200 years. But you do not despair, you accelerate your 
spaceship nearly to the speed of light. Then you travel there in 1 year of your time. 
You study there whatever you wanted, and you come back in 1 year subjective time. 
When you arrive back, you aged only 2 years. So you are happy, but of course you 
cannot tell the story to your brother, who stayed on Earth. Alas you can tell it to your 
grand-. . . -grand- children only. 

7.1 Formulating the twin paradox 

To do logical investigation on TwP, first we have to formulate it in our FOL language. 
To formulate TwP, let us denote the set of events encountered by observer m 
between events ei and 62 localized by m as 

EnCm(ei,e2) := {e G Ev„t, : m G e A 3p G Q'^etv(p) = e A timem(ei) < Pr < time„(e2) }. 

Then TwP in our FOL setting can be formulated as follows: 
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TwP Every inertial observer m measures at least as much time as any other observer 
k between any two events ei and 62 in which they meet and which are locahzed 
by both of them; and they measure the same time iff they have encountered the 
very same events between ei and 62: 

Vm e lOb \/k e Ob Vei,e2 G Ev LoCm(ei) A Loc„(e2) A LoCfe(ei) 
A LoCfc(e2) A A;,mGeine2 -)• timefc(ei, 62) < timem(ei, 62) 

A (timem(ei,e2) = timefc(ei,e2) ^ Enc,„(ei, 62) = Encfc(ei, 62)). 

Let us also formulate a property of clocks which we call the Duration Determining 
Property of Events (DDPE). This property states that the clocks of any two observers 
with the same world-line are synchronized, i.e., they measure the same amount of time 
between any two events that they encounter. DDPE is such a basic property of clocks 
that it is a possible candidate for assuming it as an axiom (if it is not provable from 
the other axioms). 

DDPE If each of two observers encounters the very same (nonempty) events between 
two given events, they measure the same time between these two events: 

V/c, m G Ob Vei, 62 G Ev m. A; G ei fl 62 

A Enc,„(ei,e2) = Encfc(ei,e2) -)■ time„i(ei, 62) = timefc(ei, 62), 

see the right hand side of Fig. 7.1. 

Theorem 7.1.1. For every Euclidean ordered field not isomorphic to M, there is a 
model OJt of AccRelo such that the quantity part of 971 is and S!Jt ^ Twp; moreover, 
OJl^DDPE. 

Thm. 7.1.1 is rather surprising since stationary inertial clocks are synchronized by 
SpecRel, and AxCmv states that accelerated clocks locally behave like inertial one^. The 
proof of this theorem is at p. 70. 

Thm. 7.1.1 also has strong consequences, it implies that to prove the Twin Paradox 
or even DDPE, it does not suffice to add all the FOL formulas valid in M to AccRelo. 
Let Th{M) denote the set of all FOL formulas valid in M. The following corollary 
formulates this strong consequence. 

Corollary 7.1.2. T/i(M) + AccRelo ^ TwP and ThiW) + AccRelo ^ DDPE. 
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Figure 7.1: Illustration of TwP and DDPE 

Proof of Cor. 7.1.2 . Let £2 be a field elementarily equivalent to M, i.e., all FOL for- 
mulas valid in M are valid in 0, too. Assume that is not isomorphic to M. For 
example, the field of the real algebraic numbers is such. Let 971 be a model of AccRelo 
with quantity part O. in which neither TwP nor DDPE is true. Such an OJt exists by 
Thm. 7.1.1. That shows that T/i(M) + AccRelo ^ TwP V DDPE since 971 h T/i(M) by 
assumption. JH 

An ordered field is called non- Archimedean if it has an element a such that, for 
every positive integer n, 

-1 < g + . . . + g < 1. 

n 

We call these elements infinitesimally small. These are not FOL definable concepts 
in our language; however, that is not a problem since we will not use them in formulas. 
The following theorem says that, for countable or non- Archimedean Euclidean or- 
dered fields, there are quite sophisticated models of AccRelo iii which TwP and DDPE 
are false. 

Theorem 7.1.3. For every Euclidean ordered field which is non- Archimedean or 
countable, there is a model OJt of AccRelo such that 971 ^ TwP, 971 ^ DDPE, the 
quantity part of 971 is £3 and (i)-(iv) below also hold in 97t. 

(i) Every observer uses the whole coordinate system as coordinate-domain: 

Vm e Ob Cdm = Q'^. 
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(ii) At any point in Q*^, there is a co-moving inertial observer of any observer: 

VA; G Ob Vg G Q'^ 3m G lOb m ^^ k. 

(iii) All observers coordinatize the same set of events: 

Vm, /c G Ob Vp G Q'' 3g G Q'^ eVmip) = evk{q). 

(iv) Every observer coordinatizes every event only once: 

Vm G Ob Vp, g G Q'^ eVm{p) = eVm{q) -^ p = q. 

Proofs of Thms. 7.1.1 and 7.1.3 . We construct four models. Before the constructions 
let us introduce a definition. For every p G Q'^, let rrij^ : Q'^ — )■ Q'^ denote the translation 
by vector p, i.e., rrij^ : g i— )■ q + p. Function / : Q^ — )■ Q'^ is called translation-like iff 
for all g G Q'^, there is a 5 G Q^ such that f{p) = m,f^^)_^{p) for all p G Bs{q), and 
f{p) = IXq) ^'^^ P(T = o imply that g^- = o for all p,q& Q,'^. 

Let Q. = (Q; +,-,<) be an Euclidean ordered field and let /;; : Q'^ — t- Q'^ be a 
translation-like map. First we construct a model 9?l(ij,fc) of AccRelo and (i) and (ii) of 
Thm. 7.1.3, which will be a model of (iii) and (iv) of Thm. 7.1.3 if fc is a bijection. 
Then we choose and k appropriately to get the desired models in which DDPE and 
TwP are false. 

Let us now construct the model 3Jt(Q^fc)- Let IOb:={mp> : p E Q'^}, Ob:=IOb U {k}, 
Ph:={/ : 3p, g G Q'^ / = line{p, q) A \pa- — qa\ = \Pt — qA}-, and B:=Ob U Ph. Recall 
that o is the origin, i.e., (0, . . . , 0). First we give the worldview of m^., then we give the 
worldview of an arbitrary observer h by giving the worldview transformation between h 
and mg. Let wlmg{ph):=ph and wlmg{h):={h{x) : Xo- = o } for all ph G Ph and h G Ob. 
And let eVms{p)'-={b G B : p G w/„,,(6)} for all p G Q''. Let w'^^^:=h for all h G Ob. 
From these worldview transformations, we can obtain the worldview of each observer h 
in the following way: evh{p)-=evmg{h{p)) for all p E Q*^. And from the worldviews, we 
can obtain the W relation as follows: for all h G Ob, 6 G B and p G Q*^, let W(/i, b,p) 
iff 6 G evh{p). Thus we have given the model ^(Q^k)- Let us note that w]^ = m o h~^ 
and mh[q)-q >~q h for all m.,h E Ob and g G Q'^. It is easy to check that the axioms 
of AccRelo and (i) and (ii) of Thm. 7.1.3 are true in 5[t(Q,A:) and that (iii) and (iv) of 
Thm. 7.1.3 are also true in 97l(ij,fc) if /^ is a bijection. 

To construct the first model, we choose £3 and k such that TwP falls in 2Jl(£!^fe). Let 
Q be an Euclidean ordered field different from M. To define k let {/i, /2, Is, /4, Is} be 
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Figure 7.2: Illustration for the proofs of Thms. 7.1.1 and 7.1.3 

a partition^ of Q such that every Jj is open, xG/2 ^x+lG/3 ^x + 2g/4, and 
for all y & li and z ^ Ij, y < z ^ i "^ j- Such a partition can be easily constructed.^ 
Let 



k{p):-- 



for every j? G Q*^, see Fig. 7.2. It is easy to see that k is a. translation-like bijection. 
Let p,qE Q'^ be coordinate points such that Pa = Qa = o and p^ E h, Qt & h] and let 
'm:=mg, ei:=evk{p), e2:=evk{q). It is easy to see that TwP is false in Tl^Q^k) for k, m, 
and ei, 62 since 

timem(ei, 62) = \k{p)r - k{q)r\ < \Pt - qA = timefe(ei, 62), 

see Fig. 7.2. 

To construct the second model, let be an arbitrary Euclidean ordered field dif- 
ferent from M and let {/i, J2} be a partition of Q such that x < y for all x G Ji and 
y G h- Let 

p- It if Pr G I2 



^i.e., It's are disjoint and Q == /i U /2 U /a U /4 U I^. 

^Lct i7 C Q be a nonempty bounded set that does not have a supremum. Let Ii:={x G Q : 3h E 

H X < h}, l2:={x + 1 e Q : X e Ii}\Ii, h-^ix + 1 G Q ; .t e I2}, h:={x + I e Q : x e I3} and 

/5: = Q\(/lU/2U/3U/4). 
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Figure 7.3: Illustration for the proofs of Thms. 7.1.1 and 7.1.3 
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for every p G Q*^, see Fig. 7.3. It is easy to see that k is translation-like. Let p,q& Q'^ 
such that Pa = Qa = o; Pt,Pt + 1 G /i; and g^, Q'r — 1 G h- And let m:=mg, ei:=evk{p), 
e2-=evk{q)- It is also easy to see that DOPE is false in OJt(Q^fc) for k, m and ei, 62 since 
m and k encounter the very same events between ei and 62, however, 

timefc(ei, 62) = \Pt- qA^ \Kp)r - Kq)t\ = time„(ei, 62), 

see Fig. 7.3. This completes the proof of Thm. 7.1.1. 

To construct the third model, let O be an arbitrary non-Archimedean, Euclidean 
ordered field. Let a ~ 6 denote that a, 6 G Q and a — 6 is infinitesimally small. It is not 
difficult to see that ~ is an equivalence relation. Let us choose an element from every 
equivalence class of ~; and let the chosen element equivalent to a G Q be denoted by 
a. Let k{p):={pr +PT,Pa) for every p G Q'^, see Fig. 7.3. It is easy to see that k is a. 
translation-like bijection. Let p:=o, q:=lt, k{p) = (0, 0, . . . , 0), k{q) = (1 + 1, 0, . . . , 0). 
And let m:=ms, ei:=evk{p), e2:=evk{q)- It is also easy to check that DOPE is false in 
^(Q,k) for k, m and ei, 62 since m and k encounter the very same events between ei 
and 62, however, 

timefc(ei, 62) = |Pr - grl 7^ |^(p )r - k{q)r\ = timem(ei, 62), 

see Fig. 7.3. 

To construct the fourth model, let H be an arbitrary countable Archimedean Eu- 
clidean ordered field and let k{p) = {f{pT),Pa) for every p G Q'' where / : Q — )■ Q is 
constructed as follows, see Figs. 7.3 and 7.4. We can assume that £3 is a subfield of 
M by [27, Thm.l in §VIII]. Let a be a real number that is not an element of Q. Let 
us enumerate the elements of [a, a -|- 2] fl Q and denote the i-th element by r^. First 
we cover [a, a -|- 2] fl Q with infinitely many disjoint subintervals of [a, a + 2] such that 
the sum of their lengths is 1, the length of each interval is in Q and the distance of 
the left endpoint of each interval from a is also in Q. We construct this covering by 
recursion. In the z-th step, we will use only finitely many new intervals such that the 
sum of their lengths is 1/2*. In the first step, we cover ri with an interval of length 1/2. 
Let us suppose that we have covered Tj for each i < n. Since we have used only finitely 
many intervals so far, we can cover r„ with an interval that is not longer than 1/2". 
Since Y17=i -'-Z^* ^ -'-' ^^ ^^ ^'^^ difficult to see that we can choose finitely many other 
subintervals of [a, a + 2] to be added to this interval such that the sum of their lengths 
is 1/2". We are given the covering of [a, a -|- 2]. Let us enumerate these intervals. Let 
li be the z-th interval, di be the length of /j, do'-=0 and Cj > the distance of a and 
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fix):- 



for all X G Q, see Fig. 7.4. It is easy to see that fc is a translation-like bijection. Let 
p,qE Q'^ such that p^ < a and a + 2 < g^; and let m:=mg, ei:=evk{p), e2:=evk{q). It 
is also easy to check that DDPE is false in 9Jt(£!,fc) for k, m and ei, 62 since m and k 
encounters the very same same events between ei and 62, however, 

timefc(ei, 62) = br - ^rl 7^ \Kp)t - k{q)r\ = timem(ei, 62), 
see Fie;. 7.3. 




a + 2 



Figure 7.4: Illustration for the proofs of Thms. 7.1.1 and 7.1.3. 



7.2 Axiom schema of continuity 

As it was proved in Section 7.1, AccRelo is not strong enough to prove properties of 
accelerated clocks, such as the twin paradox or even DDPE. The additional property 
we need is that every bounded nonempty subset of the quantity part has a supremum. 
That is a second-order logic property (because it concerns all subsets) which we cannot 
use in a FOL axiom system. Instead, we will use a kind of "induction" axiom schema. 
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It will state that every nonempty, bounded subset of the quantity part which can be 
defined by a FOL formula (using possibly the extra part of the model, e.g., using the 
worldview relation) has a supremum. To formulate this FOL axiom schema, we need 
some more definitions. 

li if is a formula and x is a variable, then we say that x is a free variable oi ip iS x 
does not occur under the scope of either 3x or Vx. Sometimes we introduce a formula 
ip as v^(x ), which means that all the free variables of ip lie in x. 

If (p{x,y) is a formula and DJl = {U; . . .) is a model, then whether ip is true or 
false in 9Jl depends on how we associate elements of U with the free variables x and y. 
When we associate a & U with x and b ^ U with y, (p{a, b) denotes this truth-value; 
so ipia, b) is either true or false in QJt. For example, ii (p is x < y, then c/?(0, 1) is true 
while ip{l, 0) is false in any ordered field. A formula ip is said to be true in DJl ii ip is 
true in 9Jt no matter how we associate elements with the free variables. We say that a 
subset iJ of Q is (parametrically) definable by ip{y,x) iff there is an a G f/" such 
that H = {b E Q : ip{b, a) is true in DJl}. We say that a subset of Q is definable iff 
it is definable by a FOL formula. 

Now we formulate the promised axiom schema. To do so, let 0(x, y) be a FOL 
formula of our language. 

AxSup0 Every subset of Q definable by 0(x, y ) has a supremum if it is nonempty and 
bounded. 

A FOL formula expressing AxSup^ can be found in Chap. 10. Our axiom schema CONT 
below says that every nonempty bounded subset of Q that is definable in our language 
has a supremum: 

CONT := { AxSup,^ : (yj is a FOL formula of our language } . 

Let us note that CONT is true in any model whose quantity part is M. And let us call 
the collection of the axioms introduced so far AccRel: 



AccRel:=AccRelo U CONT 



Our next theorem states that DDPE can be proved from our FOL axiom system 
AccRel if d > 3. 

Theorem 7.2.1. AccRel \= DDPE if rf > 3. 

Proof . Let k and m be observers, and let ei and 62 be events localizable by m and 
k such that m,k G ei fl 62 and EnCm(ei,e2) = EnCfc(ei,e2). We have to prove that 
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timem(ei, 62) = timefc(ei, 62). Let p:=loCfc(ei) and g:=loCfc(e2), and let p':=\oCm{ei) and 
g':=loCm(e2). Then timeA;(ei, 62) = {pr-qA and timem(ei, 62) = |p'^-g:J-|. See the right 
hand side of Fig. 7.1. 

We can assume that Pr < Qt and p'^ < q'^. Let h E lOb. We prove that [g^ — 
Pr\ = Iq'r —Pr\, by applying Thm. 10.4.3 as follows: let [a, 6]:=[pr, ^r], [0'',b']: = [p'^,q'A, 
/:=lc^ and g:=\c^. By AxSelf^ and AxCmv, we conclude that [a,b] C Dom f and 
[a',b'] C Domg since h G 10b, see Prop. 6.1.6. From AccRelo it follows that / and g 
are definable and well-parametrized timelike curves, see Thm. 6.1.11. By AxSelfo, we 
have that {/(r) : r E [a, 6] } = {g{r') : r' G [a' ,h'] } since EnCfc(ei,e2) = EnCm(ei,e2). 
Thus, by Thm. 10.4.3, we conclude that |g^ — Pt\ = I^Zr ^Pr|i ^.nd that is what we 
wanted to prove. Ill'; 

Now let us prove the following theorem stating that the twin paradox is a logical 
consequence of AccRel if rf > 3. 

Theorem 7.2.2. AccRel ^ TwP if rf > 3. 

Proof . Let m G 10b and k G Ob; and let ci and 62 be events localizable by m and k 
such that m, /c G ei n 62- By Thm. 7.2.1, DDPE is provable from AccRel. So we have 
to prove the following only: 

timem(ei,e2) > timefc(ei,e2), and (7.1) 

Encm(ei,e2) = Encfc(ei,e2) if timem(ei, 62) = timefc(ei, 62). (7.2) 

To do so, let p:=loc/c(ei), g:=loCfc(e2); and let p':=loCm(ei), g':=loCm,(e2). Then 

timefc(ei, 62) = \Pr - qA and time„(ei, 62) = \p'^ - q% 

see Fig. 7.1. Thus we have to prove that \qr — Pt\ < Wt "PtI; ^'^d that EnCm(ei,e2) = 
Encfc(ei, 62) if lo'r — Pr| = kr ~Prl- ^c are going to prove them by applying Thm. 10.4.2 
to Ic^ and [pr^qr]- From AccRelo we have that 

lc„ : Q ^^ Q is a definable and well-parametrized timelike curve, (7.3) 

see Thm. 6.1.11. By AxSelfo, Va = q^ = p'a = q'a — ^ since m. A; G ei fl 62. By the 
definition of life- curve, 

\ci{p^)=p' and lc^(g.) = g'. (7.4) 

By AxSelf^, we have 

[Pr,qr]^Dom\cl. (7.5) 
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Hence, by applying (i) of Thm. 10.4.2 to Ic^ and [Pt^Qt], we get that 

Consequently, timefc(ei, 62) < timem(ei, 62). So (7.1) is proved. 

We prove (7.2) by proving its contraposition. Moreover, we prove that timefc(ei, 62) < 
timem(ei, 62) if EnCm(ei,e2) 7^ Encfc(ei,e2). That will be proved by applying (ii) of 
Thm. 10.4.2 to Ic^ and [p^, Qt]- To do so, let us assume that EnCm(ei, 62) 7^ EnCfc(ei, 62). 
Since EnCm(ei,e2) 7^ EnCfc(ei,e2), there are two possibilities: either there is an event 
e such that e G Encfc(ei,e2) and e ^ EnCm(ei,e2), or there is an event e such that 
e G EnCm(ei,e2) and e ^ Encfc(ei,e2). If there is such e, there is an x G [Pt,(1t] such 
that lc^(x) = loCm(e). By CONT-Bolzano Theorem, lcj^(x)^ G K,^:^], since Ic^ is a 
definable timelike curve and \c'^{pr) = p', lc^(gT-) = q'. Thus, since e ^ EnCm(ei,e2), 
we have lc„(x)o- 7^ o. If e is such that e G EnCm,(ei,e2) and e ^ Encfc(ei,e2), then 
lc^(t) 7^ loCm,(e) for all t G [pr, Q'r]- By CONT-Bolzano Theorem, there is an x G [pr, Qt] 
such that lc^(a;)T- = loCm(e)^. By AxSelfo, loCm(e)<j = o. Therefore, lc^(x)o- 7^ o since 
e ^ EnCfc(ei,e2). So in both cases there is an x G [Pt,C[t] such that lc^(x)cr 7^ o = 
Ic^(Pt-)o-. Consequently, there is an x G Dom\c^ such that 

X G [pr,g^] and lc^(x)^ 7^ lc^(Pr)a- 

By (ii) of Thm. 10.4.2, we get that 

\qr-Pr\ < |lc^(gr)r " lc^(Pr)r| = l^r -Pr|- 

Consequently, timefc(ei, 62) < timem(ei,e2) if EnCfc(ei,e2) 7^ EnCm(ei,e2). That com- 
pletes the proof of the theorem. H 

Question 7.2.3. Can the CONT axiom schema be replaced by some natural assump- 
tions on observers such that the theorem above remains valid? 

Remark 7.2.4. The assumption d > 3 cannot be omitted from Thm. 7.2.2. However, 
Thms. 7.2.2 and 7.2.1 remain true if we omit the assumption d > 3 and assume the 
auxiliary axioms AxThExp of Chap. 4 and AxLine defined below, i.e., 

AccRel + AxThExp + AxLine \= TwP A DDPE 

holds for d = 2, too. A proof for the latter statement can be obtained from the proofs 
of Thms. 7.2.2 and 7.2.1 by [73, Items 4.3.1, 4.2.4, 4.2.5] and [3, Thm.l.4(ii)]. 

AxLine World-lines of inertial observers are lines according to any inertial observer: 

Vm, k G lOb 3p, qEQ'^ wlm{k) = line{p, q). 
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Question 7.2.5. Can the assumption c? > 3 be omitted from Thm. 7.2.1, i.e., does 
AccRel h DDPE liold for d = 27 

In tlie next cliapter, we discuss how the present methods and in particular AccRelo 
and CONT can be used for introducing gravity via Einstein's equivalence principle 
and for proving that "gravity causes time to run slow" (also called gravitational time 
dilation). In this connection we would like to point out that it is explained, in Misner et 
al. [43, pp. 172-173, 327-332], that the theory of accelerated observers (in fiat spacetime) 
is a rather useful first step in building up general relativity by using the methods of 
that book. 
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Chapter 8 

Simulating gravitation by 
accelerated observers 



Before we derive a FOL axiom system of general relativity from our theory AccRel, let 
us investigate the strength of AccRel by proving some theorems on gravitation from it. 
The results of this chapter are based on [37] and [36]. Here we investigate the effect 
of gravitation on clocks in our FOL setting by proving theorems about gravitational 
time dilation. This effect roughly means that "gravitation makes time flow slower," 
that is to say, clocks in the bottom of a tower run slower than clocks in its top. We use 
Einstein's equivalence principle to treat gravitation in AccRel. This principle says that 
a uniformly accelerated frame of reference is indistinguishable from a rest frame in a 
uniform gravitational field, see, e.g., d'Inverno [17, §9.4]. So instead of gravitation we 
will talk about acceleration and instead of towers we will talk about spaceships. This 
way the gravitational time dilation will become the following statement: "Time flows 
more slowly in the back of a uniformly accelerated spaceship than in its front." 

One of the reasons why gravitational time dilation is interesting and important is 
that general relativistic hyper computing is based on this effect, see [9], [16]. Another 
reason is that it leads to other surprising effects, such as that "time stops" at the event 
horizons of huge^ (ca. 10^" solar mass) black holes. That is true because at the event 
horizon "gravitational force" (meant in the sense of Rindler [52, §11.2 p. 230]) tends to 
infinity. The possibility of the existence of (traversable) wormholes is also related to 
these ideas, see [32, p. 140], Novikov [48], Thorne [76] and [51]. 

Here we concentrate on the general case when the spaceship is not necessarily uni- 
formly accelerated. This case corresponds to the situation when the tower is in a 
possibly changing gravitational field. At first it is not clear whether the changing 



^This statement is true for any black hole but it is interesting in the case of huge ones. 
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gravitational field has any physical relevance. However, every "physical" gravitational 
field is changing slightly. For example, the source of the gravitation may lose energy 
by radiation, which might significantly change the gravitational field in the long run. 
Black holes may radiate by Hawking's radiation hypothesis. Changing gravitational 
fields also play a key role in the theory of gravitational waves. 

8.1 Formulating gravitational time dilation 

Let us formulate the sentence "Time fiows more slowly in the back of an accelerated 
spaceship than in its front." in our FOL language. 
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Figure 8.1: Illustrations of the radar distance and the Minkowski distance, respectively 

To talk about spaceships, we need a concept of distance between events and ob- 
servers. We have two natural candidates for that: 

• Event e is at radar distance A G Q"*" from observer k iff there are events e\ and 
62 and photons "phx and p/i2 such that /c G ei fl 62, p/ii G e fl ei, p/12 G e fl 62 and 
timefc(ei, 62) = 2A. Event e is at radar distance from observer /c iff /c G e. See 
(a) of Fig. 8.1. 



Event e is at Minkowski distance A G Q from observer k iff there is an event 
e' such that A; G e', e ~m e' and distm(e, e') = A for every co-moving inertial 
observer m of /c at e'. See (6) of Fig. 8.1. 



We say body b is at constant radar distance from observer k according to k iff the 
radar distance (from k) of every event in which b participates is the same. The notion 
of constant Minkowski distance is analogous. 

To state that the spaceship does not change its direction, we need to introduce 
another concept. We say that observers k and b are coplanar iff \N\m{k) U wlm(6) is a 
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subset of a vertical plane in the coordinate system of an inertial observer m. A plane 
is called a vertical plane iff it is parallel to the time-axis. 

Now we introduce two concepts of spaceship. Observers 6, k and c form a radar 
spaceship, in symbols >|6, /c, c) , iff 6, k and c are coplanar and h and c are at (not 
necessarily the same) constant radar distances from k according to k. The definition 
of the Minkowski spaceship, in symbols >|6, /c, c) , is analogous. 

We say that event ei precedes event 62 according to observer k iff loCm(ei)T < 
loCm,(e2)r for all co-moving inertial observers m of k. In this case we also say that 
62 succeeds ci according to k. We need these concepts to distinguish the past and 
the future light cones according to observers. Let us note that no time orientation 
is definable from AccRel; so we can only speak of orientation according to observers. 
However, there are several possible axioms which make time orientation possible, e.g.. 



Vm,fceIOb wt{d)r < wtilAr 
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62 and ei ~^ 62, respectively 



We also need a concept to decide which events happen at the same time according 
to an accelerated observer. The following three natural concepts offer themselves: 



Events ei and 62 are radar simultaneous for observer /c, in symbols ei ~™ 62, iff 
there are events e, ei, 62, ei, 62 and photons pi, P2, Pi, P2 such that A; G eflCj flCj, 



Pi e ej n Ci, Pi e ej n ei, (e, 7^ ei or ei = e) and timefc(e,ei 
i e {1,2}, see Fig. 8.2. 



timefcfe, ei) if 



ph 



Events ei and e2 are photon simultaneous for observer k, in symbols ei ~^" e2, 
iff there are an event e and photons phi and p/i2 such that /c G e, p/ii G e fl ei, 
ph2 G e n e2 and ei and e2 precede e according to k. See (6) of Fig. 8.2. 
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• Events ei and 62 are Minkowski simultaneous for observer k, in symbols 
ei ~fc 62, iff there is an event e such that fc G e and ei and 62 are simultane- 
ous for any co-moving inertial observer of k at e. See (c) of Fig. 8.2. 

Remark 8.1.1. Let us note that, for inertial observers, the concepts of radar simul- 
taneity, Minkowski simultaneity and the concept of simultaneity introduced on p. 12 
coincide, and any two of these three simultaneity concepts coincide only for inertial 
observers. 

Radar simultaneity and Minkowski simultaneity are the two most natural general- 
izations (for non- merim/ observers) of the standard simultaneity introduced by Einstein 
in [21]. In the case of Minkowski simultaneity, the standard simultaneity of co- moving 
inertial observers is rigidly copied, while in the case of radar simultaneity, the stan- 
dard simultaneity is generalized in a more flexible way. Dolby and Gull calculate and 
illustrate the radar simultaneity of some coplanar accelerated observers in [18] . 

Let us note that the Minkowski simultaneity of observer k is an equivalence rela- 
tion if and only if k does not accelerate. So one can argue against regarding it as a 
simultaneity concept for non-inertial observers, too. We think, however, that it is so 
straightforwardly generalized from the standard concept of simultaneity that it deserves 
to be forgiven for its weakness and to be called simultaneity. Let us also note that the 
Minkowski simultaneity of k is an equivalence relation on a small enough neighborhood 
of the world-line of k if this world-line is smooth enough. 

The concept of photon simultaneity is the least usual and the most naive. It is based 
on the simple idea that an event is happening right now iff it is seen to be happening 
right now. Some authors require from a simultaneity concept to be an equivalence 
relation such that its equivalence classes are smooth spacelike hypersurfaces, see, e.g., 
Matolcsi [42] . In spite of the fact that equivalence classes of ~| are neither smooth nor 
spacelike, we think that it might to be called simultaneity, see, e.g., Hogarth [31] and 
Malament [40]. This concept occurs as a possible simultaneity concept in some of the 
papers investigating the question of conventionality/deflnability of simultaneity, see, 
e.g., Ben-Yami [13], Rynasiewicz [59], Sarkar and Stachel [61]. Let us also note that all 
of the introduced simultaneity and distance concepts are experimental ones, i.e., they 
can be determined by observers by means of experiments with clocks and photons. 

We distinguish the front and the back of the spaceship by the direction of the 
acceleration, so we need a concept for direction. We say that the directions of p G Q*^ 
and g G Q'^ are the same, in symbols p 't\'q, if p and q are spacelike vectors, and 
there is a A G Q^ such that X ■ p„ = q^^, see (a) of Fig. 8.3. When p and g are timelike 
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vectors, we also use this notation if p^g^ > 0. 
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Figure 8.3: (a) illustrates p tt ^; ^'^^ (b) illustrates that observer c is approaching 
observer b, as seen by b by photons. 



Now let us focus on the definition of acceleration in our FOL setting. The life-curves 
of observers and the derivative /' of a given function / are both FOL definable concepts, 
see pages 58 and 108. Thus if the life-curve of observer k according to observer m is 
a function, then the following definitions are also FOL ones. The relative velocity 
v^ of observer k according to observer m at instant t G Q is the derivative of the 
life-curve of k according to m at t if it is differentiable at t; otherwise it is undefined. 
The relative acceleration a^ of observer k according to observer m at instant t G Q 
is the derivative of the relative velocity of k according to m at t if it is differentiable at 
t; otherwise it is undefined. 

We say that the direction of the spaceship >|6, A;,c) is the same as that of 
the acceleration of k iff the following holds: 

Vm G lOb Vt G Dom a^' Vp , q G Cd^ 

ceev^ip) A beevm{q) A paq -^ di'^it) ^ {p - q) . 

The acceleration of observer k at instant t G Q is defined as the unsigned Minkowski 
length of the relative acceleration according to any inertial observer m at t, i.e., 

afc(t):=-/i(a^(t)). 

The reason for the "— " sign in this definition is the fact that /i(a^(t)) is negative since 
a^(t) is a spacelike vector, see Thm. 6.1.11 and Prop. 10.5.7. The acceleration is a 
well-defined concept since it is independent of the choice of the inertial observer m, see 
Thm. 3.2.2 and Prop. 10.5.9. We say that observer k is positively accelerated iff 
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ak{t) is defined and greater than for all t E Dom Ic;.. Observer k is called uniformly 
accelerated iff there is an a G Q''" such that ak{t) = a for all t E Dom Ic^. 

We say that the clock of b runs slower than the clock of c as seen by k 
by radar iff time6(eb, e-b) < timec(ec, Cc) for all events eb, e-b, Cc, Cc for which 6 G e^ fl e^, 
c E CcH Cc and Cb ^]^'^ Cc, Cb ^]!"^ Cc- If it is seen by photons, we use ~^ instead 
of ~™'^. Similarly, if it is seen by Minkowski simultaneity, we use ~^ instead of 



rad 



8.2 Proving gravitational time dilation 

Let us prove here two theorems about gravitational time dilation. Both theorems state 
that gravitational time dilation follows from AccRel, they only differ in the formulation 
of this statement. 

Let us first prove a theorem about the clock-slowing effect of gravitation in radar 
spaceships. 

Theorem 8.2.1. Let d > 3. Assume AccRel. Let >|6, A;,c) be a radar spaceship 
such that: 

(i) observer k is positively accelerated, 

(ii) the direction of the spaceship is the same as that of the acceleration of observer 
k. 

Then both (1) and (2) hold: 

(1) The clock of b runs slower than the clock of c as seen by k by radar. 

(2) The clock of b runs slower than the clock of c as seen by each of k, b and c by 
photons. 

Proof . To prove Item (1), let >|6, A;, c) be a radar spaceship such that k is positively 
accelerated and the direction of the spaceship is the same as that of the acceleration of 
k. Let Cb, Cb, Cc, Cc be such events that 6 G e;, fl e?,, c G Cc fl ec and Cb ~™'^ Cc, eb ~™'^ Cc. 
To prove Item (1), we have to prove that time{,(eb, Cb) < timec(ec, Cc). Since >|6, k, c)^^^ 
is a spaceship, there is an merim/ observer m E lOb such that wlm(6) Uwlm(/c) Uwlm(c) 
is a subset of a vertical plane. Let m be such an inertial observer. Without losing 
generality, we can assume that this plane is the tx-Plane. We are going to apply 
Lem. 10.5.5. To do so, let /3 = k^, 7 = k^ and a = k„; and let /3* and 7* be the radar 
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Figure 8.4: Illustration for the proof of Item (1) in Thm. 8.2.1 verifying requirement 

(iii) in Lem. 10.5.5 
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reparametrization of (3 and 7 according to a, respectively. By Thm. 6.1.11, f3 and 7 
are definable and well-parametrized timelike curves. By Lems. 10.5.8 and 10.5.20, we 
can assume that ai^ is increasing and a' ttl*. By Prop. 10.5.12, /3* and 7* are definable 
timelike curves since the photon sum of any two timelike vectors of Ran a' is also a 
timelike one. Requirement (i) in Lem. 10.5.5 is clear by the definition of the radar 
reparametrization. It is also clear that there are Xf^^y^ G Doml3, x^,y^ G Dom'-f and 
x,y e Dom(3^nDom-f^ such that /^(x/j) = loc„(efe) = /3*(x), f3{yf}) = \oCm{eb) = (3^{y) 
and 7(x^) = loCm(ec) = 7*(a;), 7(^7) = loCm(ec) = 7*(y). Hence requirement (ii) in 
Lem. 10.5.5 also holds. Since the direction of >|6, /c,c) , is the same as that of the 
acceleration of k, there are only three possible orders of the observers in the spaceship. 
All these three cases are illustrated by Fig. 8.4. By Prop. 10.5.19, it is easy to see 
that /i(/3^(t)) < yLt(7^t)) for all t E {x,y); and that is requirement (iii) in Lem. 10.5.5. 
Hence by Lem. 10.5.5, |x/3 — y/?! < la;^ — y^\. Thus timeb(e(,, e;,) < timec(ec,ec) since by 
Lem. 8.2.7, timej(ej,ej) = |xj — yi\ for all i G {b,c}] and that is what we wanted to 
prove. 

To prove Item (2), there are many cases we should consider resulting from which 
order is taken by the observers in the spaceship, and which observer is watching the 
other two. The proof in all the cases is based on the very same ideas and lemmas as the 
proof of Item (1). The only difference is that we should use photon simultaneity and 
photon reparametrization instead of radar ones, and we should use Prop. 10.5.12 (and 
Lem. 10.5.8) when verifying requirement (iii) in Lem. 10.5.5. In Fig. 8.5, we illustrate 
the proof of requirement (iii) in Lem. 10.5.5 in one of the many cases. In the other 
cases, this part of the proof can also be attained by means of similar figures without 
any extra difficulty. B 

To prove a similar theorem for Minkowski spaceships, we need the following concept. 
We say that observer b is not too far behind the positively accelerated observer k 
iff the following holds: 

Vm G lOb Vt G Doma.^ \/p, q G Cdm k G eVmip) A 6 G eVm{q) 

A evm{p) -^ evm{q) A a^(t) tt(P - ?) ^ ^r e Doma^ Kp,Q) < ^TT- 

Now we can state and prove our theorem about the clock-slowing effect of gravitation 
in Minkowski spaceships. 

Theorem 8.2.2. Let d > 3. Assume AccRel. Let >|6, k, c) be a Minkowski spaceship 
such that: 
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Figure 8.5: Illustration for the proof of Item (2) in Thm. 8.2.1 verifying requirement 
(iii) in Lem. 10.5.5 
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(i) observer k is positively accelerated, 

(ii) the direction of the spaceship is the same as that of the acceleration of observer 
k, 

(iii) observer b is not too far behind k. 

Then both (1) and (2) hold: 

(1) The clock of b runs slower than the clock of c as seen by k by Minkowski simul- 
taneity. 

(2) The clock of b runs slower than the clock of c as seen by each of k, b and c by 
photons. 

Proof . The proof of this theorem is based on the very same ideas and lemmas as the 
proof of Thm. 8.2.1. The only difference is that we should use Minkowski simultaneity 
and Minkowski reparametrization instead of radar ones, and in the proof of Item (1) 
we should use Prop. 10.5.21 instead of Prop. 10.5.19 when verifying requirement (iii) 
in Lem. 10.5.5. In the proof of Item (1) of this theorem, we face the same three cases 
as in the proof of Item (1) in Thm. 8.2.1. By (a), (b) and (c) of Fig. 8.6, we illustrate 
the proof of requirement (iii) in Lem. 10.5.5 in this three cases. Similarly, in the proof 
of Item (2) of this theorem, we face the same large number of cases as in the proof of 
Item (2) in Thm. 8.2.1. By (d) of Fig. 8.6, we illustrate the proof of requirement (iii) 
in Lem. 10.5.5 in one of these many cases. We do not go into more details here since 
the rest of the proof can be put together with the help of the hints above. 



We have seen that gravitation (acceleration) makes "time flow slowly." However, we 
left the question open which feature of gravitation (its "magnitude" or its "direction" ) 
plays a role in this effect. The following theorem shows that two observers, say b and 
c, can feel the same gravitation while the clock of b runs slower than the clock of c. 
Thus it is not the "magnitude" of the gravitation that makes "time flow slowly." 

Theorem 8.2.3. Let d > 3. Then there is a model of AccRel, and there are observers 
b and c in this model such that ab{t) = ac{t) = 1 for all t G Q, but the clock of b 
runs slower than the clock of c as seen by b by photons (or by radar or by Minkowski 
simultaneity). 

Proof . To prove the theorem, let Q be the fleld of real numbers and let 

P{ty.= {sh{t),ch{t), 0,...,0) and -f{t):={sh{t),ch{t) + 1,0 ... ,0) 
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Figure 8.6: Illustration for the proof of Thm. 8.2.2 verifying requirement (iii) in 
Lem. 10.5.5 
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where sh and ch are the hyperbohc sine and cosine functions. Since both /3 and 7 are 
smooth and well-parametrized timelike curves, we can easily build a model of AccRel 
such that Ic^ = /3 and Ic^ = a for some m G lOb. By a straightforward calculation, 
we can show that fi(^(3"{t)) = iji{j"{t)) = — 1 for all t G Q. Hence ab(t) = ac{t) = 1 for 
all t G Q. 





Figure 8.7: Illustration for the proof of Thm. 8.2.3 



It is easy to show that c is approaching b as seen by b by photons, see (a) of 
Fig. 8.7. Thus by Lem. 8.2.6, the clock of b runs slower than the clock of c as seen by 
b by photons. It is not difficult to show that evmip) ~™'^ evmiq) iff evmip) ~b evmiq) 
iff o G line{p, q). Thus the clock of b runs slower than the clock of c as seen by b by 
both radar simultaneity and Minkowski simultaneity, see (b) of Fig. 8.7. ■ 



Let us now prove some lemmas that were used in the proofs above. First let us 
introduce two concepts which are strongly connected to the flow of time as seen by 
photons, see Lem. 8.2.6. We say that observer c is approaching (or moving away 
from) observer b as seen by b by photons at event ei, iff the following hold 

• b e Cb, 

• for all events Cc for which c G Cc and Cb ~^ Cc hold, there is an event e such that 
b',c' G e for every co- moving inertial observers b' and d of b at event Cb and of c 
at event Cc, respectively, and 

• Cb precedes (succeeds) e according to 6, 
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see (b) of Fig. 8.3. We say that c is approaching (moving away from) b as seen by b by 
photons iff it is so for every event eb for which b G Cb- The idea behind these definitions 
is the following: two observers are considered approaching when they would meet if 
they stopped accelerating at simultaneous events. 

Remark 8.2.4. Let us note that coplanar inertial observers seen by photons are 
approaching each other before the event of meeting and moving away from each other 
after it. This fact explains the words used for these concepts. 

Remark 8.2.5. There is no direct connection between the two concepts above. For 
example, it is not difficult to construct a model of AccRel in which there are (uniformly 
accelerated) observers b and c such that c is approaching b seen by b by photons while 
b is moving away from c seen by c by photons, see the proof of Thm. 8.2.3. 

Lem. 8.2.6 can be interpreted as a refined version of the Doppler effect. 

Lemma 8.2.6. Let d > 3. Assume AccRel. Let b and c be coplanar observers. Then 

(1) If c is approaching b as seen by b by photons, the clock of b runs slower than the 
clock of c as seen by b by photons. 

(2) If c is moving away from b as seen by b by photons, the clock of c runs slower 
than the clock of b as seen by b by photons. 

Proof . To prove Item (1), let b and c be coplanar observers, and let Cb-, eb, Cc and Cc 
be such events that b G eb O eb, c G Cc O Cc and Cb ~^ Cc, et ~6 Cc. Let us suppose 
that c is approaching b as seen by b by photons. We have to prove that timefe(eb, Cb) < 
timec(ec, Cc). Since c and b are coplanar, there is an inertial observer m G lOb such 
that wlm(c) Uwlm(6) is a subset of a vertical plane. Let m be such an meriza/ observer. 
We are going to apply Lem. 10.5.5. To do so, let /3 = /3* = lc,.„, 7 = Ic^, and let 7* 
be the photon reparametrization of 7 according to /3. By Thm. 6.L11, /3 = /3* and 
7 are definable and well-parametrized timelike curves. Without losing generality, we 
can assume that f3' tt h and 7' tt !<• It is easy to see that vj\m{b) fl wlm(c) = since 
c is approaching b as seen by b. Thus Ran P fl Ran'-/ = since Ran P = wlm(6) and 
Ran'-) = wlm(c) by Item (5) in Prop. 6.L6. Thus 7^, is also a definable timelike curve by 
Prop. 10.5.12. Requirement (i) in Lem. 10.5.5 is clear by the definition of the photon 
reparametrization. It is also clear that there are xp,yp G Domfi, x^,y^ G Dom^ and 
X, y G DomP^ n -Dom7* such that /^(x/j) = \oCmieb) = [5*{x), I3{yp) = loCm(efe) = /3*(y) 
and 7(x-y) = loCm(ec) = 7*(x), 7(2/7) ~ loCm(ec) = 7*(y)- Hence requirement (ii) in 

91 




Figure 8.8: Illustration for the proof of Lem. 8.2.6 

Lem. 10.5.5 also holds. Since c is approaching b as seen by b by photons, the tangent 
lines of /3* and 7^, at any t G {x,y) intersect in the future of /3*(t) and 7*(t). Thus 
fi(/3l{t)) = 1 < i^{-fiit)) for all t e {x,y) by Prop. 10.5.12, see Fig. 8.8; and that is 
requirement (iii) in Lem. 10.5.5. Hence by Lem. 10.5.5, we have that |x/3— y/^l < \x^—y^\. 
Consequently, timeh(e;,, ej,) < timec(ec,ec) since by Lem. 8.2.7, timej(ej, Cj) = |xj — yi\ 
for all i G {b, c}. So Item (1) is proved. 

The proof of (2) is similar. Hence we omit it. B 

Lem. 8.2.7 states that the time measured according to the parametrization of the 
life-curve Ic^ between two parameter points and the time measured by observer k 
between the corresponding events is the same if AxSelfo and AxPh are assumed and 
m G lOb. 

Lemma 8.2.7. Assume AxSelfg, AxPh, and let m G lOb. Let k G Ob. Let x^y E 



Dom\c^. Then 



t\mek{evm{\c'^ix)) , evm{\ct,iy))) = \x - y\. 



U] 



Proof . By (2) in Prop. 6.1.6, Ic^ is a function. Thus Ic^^^) ^^^ ^^miv) ^^^ meaningful. 
We have that k G eVm{\c'^ix)) f]eVm{\c'^iy)) by the definition of Ic^. Thus by AxSelfo, 
both events eVm{\c^{x)) and eVm{\c^{y)) have unique coordinates in Cd^- Thus the 
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left hand side of equation (8.1) is defined and equal to 

\0Ck{eUm{\c'^ix)))^ - loCfc(et;^(lc^(y)))^ 
by definition. However, by the definition of Ic^, 

\0Ck{eVm{\c'^ix))) ^ = X and \oCk{eVm{\c'^iy)))^ = y- 
Hence equation (8.1) holds. M'. 

None of the axioms introduced so far require the existence of accelerated observers. 
Our following axiom schema says that every definable timelike curve is the world-line of 
an observer. Since there are many timelike curves that are not lines, that will ensure the 
existence of many accelerated observers since from AxSelfo, AxPh and AxEv it follows 
that the world-lines of merfza/ observers are lines, see, e.g., Thm. 3.2.2. 

We say that a function / is (parametrically) definable by ip{x,y,z) iff there is 
an a G f/" such that f{b) = 'p O '?/^(6,p, a) is true in DJl. Let -j/^ be a FOL formula of 
our language. 

Ax30b^ If a function that is parametrically definable hj ip is a. timelike curve, then 
there is an observer whose world-line is the range of this function: 

COMPR := { Ax30b^ : ^ is a FOL formula of our language } . 

A precise formulation of COMPR can be obtained from that of its analogue in [4]. 

The following three theorems say that the clocks can run arbitrarily slow or fast, 
as seen by the three different methods. 

Theorem 8.2.8. Let d > 3. Assume AccRel and COMPR. Let m be a positively 
accelerated observer such that Dom Ic™ = Q and let e and e' be two events such that 
e 7^ e' and ?7i G e fl e'. Then for all A G Q"*", there are an observer b and events Cb and 
e'^ such that 6 G e^ fl e'^,, e ~™'^ e^, e' ~j;^'' e'j^ and timeb(efe, e'^) = A ■ timem(e, e'). 

Theorem 8.2.9. Let d > 3. Assume AccRel and COMPR. Let m be a uniformly 
accelerated observer and let e and e' be two events such that e ^ e' and m G e fl e'. 
Then for all A G Q"*", there are an observer b and events ei, and e'^ such that fe G e^ fl e'^, 
e ~^, Cfe, e' ~^ e'f, and timeb(ei„ e'J = A ■ time^(e, e'). 

Theorem 8.2.10. Let d > 3. Assume AccRel and COMPR. Let m be a positively 
accelerated observer and let e and e' be two events such that e ^ e' and m G e fl e'. 
Then for all A G Q"*", there are an observer b and events et and e'^ such that 6 G e;, fl e'^, 
e r-^^ Cb, e' ~J^ e'f, and timeb(efe, e'J = A ■ timem(e, e'). 
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8.3 Concluding remarks on gravitational time dila- 
tion 

We have proved several qualitative versions of gravitational time dilation from axiom 
system AccRel by the use of Einstein's equivalence principle. It is important to note 
that the axioms of AccRel and Einstein's equivalence principle have different statuses. 
Einstein's equivalence principle is not an axiom, it is just a guiding principle. 

The theorems of this chapter can be interpreted as saying that observers will ex- 
perience time dilation in the direction of gravitation by the corresponding measuring 
methods (photon, radar, Minkowski) if all the axioms of AccRel are "true in our world" 
and Einstein's equivalence principle is a "good" principle. 

Since gravitation can be defined by the acceleration of dropped inertial bodies, Ein- 
stein's equivalence principle can be formulated within AccRel. It raises the possibility 
of checking within AccRel how good a principle Einstein's equivalence principle is. That 
is, we can investigate for what kind of accelerated observers the Einstein's equivalence 
principle can be proved within AccRel. For a detailed investigation on this subject, see 
[71]. 

Remark 8.3.1. By Thm. 6.1.11 and Prop. 6.1.8, it is not difficult to prove that the 
quantity part of a model of AccRelo cannot be the field of real algebraic numbers if we 
assume that there are uniformly accelerated observers. 

Remark 8.3.2. By Prop. 10.1.2, the quantity part of a model of AccRel has to be a 
real-closed field. 

These remarks generate the following three questions, each of which is unanswered yet: 

Question 8.3.3. What can be the quantity part of a model of AccRelo if we also 
assume that there are uniformly accelerated observers? 

Question 8.3.4. What can be the quantity part of a model of AccRelo+COMPR? 

Question 8.3.5. What can be the quantity part of a model of AccRel+COMPR? 
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Chapter 9 

A FOL axiomatization of General 
relativity 



In this chapter we extend our investigations to general relativity by deriving a its FOL 
axiomatization from our theory AccRel, see also [8]. The axioms of general relativity are 
going to be slightly modified versions of the four axioms of special relativity together 
with one more assumption which is a refinement of the co-moving axiom of AccRel. We 
are also going to give the connections between models of our axioms and spacetimes 
that we meet in the literature on general relativity. 

We slightly refine the axioms of SpecRel and the strong co-moving axiom of acceler- 
ated observers AxSCmv (see p. 63) and get an axiomatic theory of general relativity. To 
do so, we "eliminate the privileged class of inertial reference frames" which was Ein- 
stein's original recipe for obtaining general relativity from special relativity, see [26]. 
So below we realize Einstein's original program formally and literally. We modify the 
axioms one by one using the following two guidelines: 

• let the new axioms not speak about inertial observers, and 

• let the new axioms be consequences of the old ones and our theory AccRcIq. 

To get the modified version of AxSelf , let us note that AxSelfo (see p. 17) and AxSelf^ 
(see p. 62) satisfy the requirements above. So let AxSelf" be AxSelfo A AxSelfg". The 
localized version of AxEv contains the following two statements: (1) every observer 
encounters the events in which it is observed, and (2) if observer k coordinatizes event 
e which is also coordinatized by observer m, then k also coordinatizes the events which 
are near e according to m. The first statement is already formulated in AxEvTr (see 
p. 58), and the second one can be formulated by saying that Domw^ is open for any 
observers k and m. 
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AxEv Every observer encounters the events in which it is observed; and the domains 
of worldview transformations are open, i.e., 

AxEvTr A V?ti, k G Ob Dom w^ is open. 

The locahzed version of AxPh is the following: 

AxPh~ The instantaneous velocity of photons is 1 in the moment when they are sent 
out according to the observer sending them out, and any observer can send out 
a photon in any direction with this instantaneous velocity: 

VA; G Ob Vp e Q"' A; G evk{p) -^ i^ph G Ph ph & evk{p) -^ vf{p) = l) 
A {WveQ'^^^ \v\ = l -^ 3ph ePh pheevk{p) A vj^^{p) = v), 

where v^{p) is the instantaneous velocity of body h according to observer k at p. 

Our symmetry axiom AxSymDist has many equivalent versions with respect to 
SpecRelo, see [2, §2.8, §3.9, §4.2]. We can localize any of these versions and use it in 
our FOL axiom system of general relativity. For aesthetic reasons we use AxSymTime, 
the version stating that ^Hnertial observers see each others' clocks behaving in the same 
way," see Thm. 3.1.4 at p. 21 and [4]. 

AxSymTime" Any two observers meeting see each others' clocks behaving in the same 
way at the event of meeting: 

\/k,me Ob Vti,t2 e Q 

k,meevk{{tud))neVm{{t2,o)) -^ |vr(ii)r| = |v^(t2)r| • 

Now all the four axioms of theory SpecRel are modified according to the requirements 
above. 

Strictly following the guidelines above, AxSCmv" would state that the worldview 
transformations between observers are differentiable in their meeting-point. Instead, 
we introduce a series of axioms, each of which ensures the smoothness of worldview 
transformations to some degree. 

AxDifFp The worldview transformations are n-times differentiable functions, i.e., 
V/c, m G Ob w„^ is n-times differentiable function. 
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Let us introduce the following axiom systems of general relativity: 



GenReln:= { AxSelf", AxPh", AxEv", AxSymTime", AxDiffp } U CONT 



Let us note that every model of GenRelm is a model of Gen Rein ii m > n. Let us also 
introduce a smooth version: 



GenReL:= { AxSelf", AxPh^, AxEv", AxSymTime" } U { AxDiffn : n > 1 } U CONT 



For completeness, let us mention here the localized version of AxSymDist, too. The 
reader may safely skip this axiom. 

AxSymDist" Observers meeting each other agree approximately as to the spatial dis- 
tance of a neighbouring event if this event and the event of meeting are simulta- 
neous approximately enough according to both observers: 

yk,me Ob Ve G Q+ Vp G wk{k) n wh{m) 36 G Q+ Vg G Bs{p) 

\qr-Pr\<S -iQa-Pal A I W^(g )r " W^(P )r I < ^ " I W^(g )a " W^(P )a I 



—)■ 



Wa-Pa\ - \w'^iq)a-w'^{p)J <6- \p-q\. 



The definition of Lorentzian manifolds over arbitrary real closed fields is a natural 
extension of their standard definition over M. By the following theorems, which we are 
going to prove in a forthcoming paper, the models of GenRelp are exactly the ra-times 
differentiable Lorentzian manifolds over real closed fields; and the models of GenRel,^ 
are exactly the smooth Lorentzian manifolds over real closed fields. 

Theorem 9.0.6. Let d > 3. Then GenReIn is complete with respect to n-times 
differentiable Lorentzian manifolds over real closed fields. 

Theorem 9.0.7. Let d > 3. Then GenRel^ is complete with respect to smooth 
Lorentzian manifolds over real closed fields. 

The proofs and formal statements of Thms. 9.0.6 and 9.0.7 are analogous to those 
of Cor. 3.2.5 at p. 23. These theorems can be regarded as completeness theorems in the 
following sense. Let us consider Lorentzian manifolds as intended models of GenRel. 
How to do that? In our forthcoming paper, we will give a method for constructing a 
model of GenRel from each Lorentzian manifold; and conversely, we will also show that 
each model of GenRel is obtained this way from a Lorentzian manifold. By the above, 
we defined what we mean by a formula (p in the language of GenRel being valid in a 
Lorentzian manifold, or in all Lorentzian manifolds. Then completeness means that 
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for any formula if in the language of GenRel, we have GenRelp h </) iff </) is valid in all 
n-times differentiable Lorentzian manifolds over real closed fields. That is completely 
analogous to the way how Minkowskian geometries were regarded as intended models of 
SpecRel in the completeness theorem of SpecRel, see [34, §4] and [4, Thm.11.28 p. 681]. 
Our theory GenRel was obtained from AccRel by getting rid of the concept of in- 
ertiality in the level of axioms. However, we can redefine this concept. We call the 
world-line of observer m timelike geodesic, if each of its points has a neighborhood 
within which this observer measures the most time between any two encountered event, 
i.e., 

'irE wlm{m) 36 G Q'^ Vp, g G wlm.{rn) fl Bs{r) VA; G Ob fl evmip) H evm{q) 

Wlm{k) C Bs{f) -^ \pr - gri > kr(p)r " Wk{Q)r\ ■ 

In this case we also say that observer m is an inertial body. This definition is justified 
by the Twin Paradox theorem of AccRel, see Thm. 7.2.2. This theorem says that in 
the models of AccRel the world-lines of inertial observers are timelike geodesies in the 
above sense. 

We can define lightlike geodesies in a similar fashion: a lightlike geodesic 7 is a 
lightlike curve with the property that each point in the curve has a neighborhood in 
which 7 is the unique lightlike curve through any two points of 7. 

The assumption of axiom schema COM PR guarantees that our definition of geodesic 
coincides with that of the literature on Lorentzian manifolds. Therefore we also intro- 
duce the following theory: 



GenRel+:=GenReln U COPMR 



So in our theory GenRel"*", our notion of timelike geodesic coincides with its standard 
notion in the literature on general relativity. All the other key notions of general 
relativity, such as curvature or Riemannian tensor field, are definable from timelike 
geodesies. Therefore we can treat all these notions (including the notion of metric 
tensor field) in our theory GenRel"*" in a natural way. 

Connections with our results on AccRel : Theorems proved from AccRel (our first 
approximation of GenRel) can also be reformulated and proved from GenRel, such as 
the gravitational time dilation, see Thms. 8.2.1 and 8.2.2. For lack of space, we postpone 
that to a forthcoming paper. 
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Chapter 10 

The tools necessary for proving the 
main results 



This chapter is about the development of the tools that were used in the proofs of 
the main results of the former chapters. First we have to build a FOL theory of real 
analysis. The point is to formulate and prove theorems of real analysis staying within 
FOL. We also seek for using as few assumptions as possible. 

A part of real analysis can be generalized for arbitrary ordered fields without any 
real difficulty. However, a certain fragment of real analysis can only be generalized 
within FOL for definable functions and for proofs we need a version of the CONT axiom 
schema; and there are some theorems of real analysis which are not provable even by 
the CONT schema. We refer to the generalizations which cannot be proved without 
CONT by marking them "CONT-." The FOL generalizations of some theorems, such 
as Chain Rule can be proved without CONT, so they are naturally referred to without 
the "CONT-" mark. 

Throughout this chapter C is assumed to be a FOL language that contains the 
binary relation symbol < and the unary relation symbol Q, such as our frame language 
or the language of the ordered fields. We use notation Cq for the language {Q, <}. 
Let the set of FOL formulas in language C is denoted by Fm{C) . 

In this chapter we also use the following generalized versions of our field axiom 
AxEOF: 

AxOF (Q; +,-,<) is an ordered field. 

AxPOS (Q; <) is a partially ordered set, i.e., < is a refiexive, antisymmetric and 
transitive relation on Q. 

99 



Naturally, we do not assume AxEOF in the theorems of this chapter in which AxOF or 
AxPOS is used, see Conv. 2.2.1. 

10.1 The axiom schema of continuity 

To prove some of the theorems of real analysis, we need a property of M. This property 
is that in M every bounded nonempty set has a supremum, i.e., a least upper bound. 
It is a second-order logic property which cannot be used in a FOL axiom system. 
Instead, we use an axiom schema stating that every nonempty and bounded subset of 
the quantity part that can be defined parametrically by a FOL formula has a supremum. 

This way of imitating a second-order formula by a FOL formula schema comes from 
the methodology of approximating second-order theories by FOL ones. Examples are 
Tarski's replacement of Hilbert's second-order geometry axiom by a FOL axiom schema 
and Peano's FOL axiom schema of induction replacing the second-order logic induction. 

Let {Q} C £ be a FOL language, VJl an £-model with universe M. We say 
that a subset i/ of Q is (parametrically) /^-definable hy (p & Fm{C) iff there 
are Oi, . . . , a„ G U such that 

H = {deQ : m\= ip{d, ai, . . . , a„) } . 

We say that a subset of Q is /^-definable iff it is definable by an £-formula. More 
generally, an n-ary relation i? C Q" is said to be /^-definable in DJl by parameters iff 
there is a formula ip G Fm{C) with only free variables a;i, . . . , a;„, yi, . . . ,yk and there 
are Oi, . . . , a^ G U such that 

R= {(pi,...,p„) G Q"" : 971 h (p{pi,...,Pn,ai,...,ak)}. 

AxSup,^ Every subset of Q definable by </? G Fm{C) (when using ai, . . . ,0^ as fixed 
parameters) has a supremum if it is nonempty and bounded: 

V|/i, . . . , j/„ [3x G Q ip] A [3b e Q Va; G Q (p =^ x < b] 

^ [3s G Q V6 G Q (Vx G Q ip^x<b) ^^ s<b]. 

Our axiom schema CONT^ below says that every nonempty bounded and ^-definable 
subset of Q has a supremum. 

CONT/;:= { AxSup^ : p is a. FOL formula of the language £ } . 

When C is our frame language, we omit the subscript and write CONT only. When 
the language is Cq, we write CONTq . The language {Q, -|-, ■, <} is denoted by OJ^ . 
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Remark 10.1.1. CONT^. is stronger than CONT^ if {Q, <} C C C C. 

An ordered field £l is called real closed if every positive element has a square root 
and every polynomial of odd degree has a root. 

Proposition 10.1.2. Let £l be an ordered field. Then 

1= CONTojr iff O. is real closed. 

Proof . Let £} be an ordered field such that £l \= CONToj-. To prove that H is real 
closed, let p{y) be the odd degree polynomial a2n+iy'^"'~^^ + . . . + aiy + oq- It is enough 
to prove that p{y) has a root when a2n+i > 0. Let H:={t G Q : p{t) < 0}. It is clear 
that H is nonempty, bounded and CJ^-definable. From CONT^jr, it follows that H has 
a supremum, let us call it s. Both {t : p(t) > 0} and {t : p{t) < 0} are open sets, since 
p{y) is continuous. Thus p{s) cannot be negative since s is an upper bound of H, and 
cannot be positive since s is the smallest upper bound, i.e., p{s) = as it was required. 
Let a be a positive element of Q and let H:={y G Q : y^ < a}. Then H is nonempty, 
bounded and CJ^-definable. From CONToj-, it follows that H has a supremum and for 
the same reasons as before this supremum is a square root of a. 

If n is real closed field, it is elementary equivalent to M, see [41, Cor. 3. 3. 16.]. Thus 
n 1= CONTo^ since M h CONTo^. M 

Remark 10.1.3. Let us note that CONT^ is not strong enough to prove every theorem 
of real analysis, e.g., the statement that there is a function / such that f'{x) = f{x) 
and Ran f = Q is not provable from CONT^. 

Let / be an ^-definable function. Then we denote one of the formulas defining / 
by (pf , i.e., </>/ is a formula in the language C such that 

f = {{x,y) : (pf{x,y)]. 

Proposition 10.1.4. Let /, (? : Q" — > Q*" and h : Q*" A- Q'^ be ^-definable functions 
and let A G Q. Then Dam f and Ran f are ^-definable and the following functions are 
also ^-definable ones: A ■ /, f + g and f o h. 

Proof. Let (f)f{x,y), (j)g{x,y) and (f)h{x,y) be formulas defining /, g and h in the 
language £, respectively. Then we can define Dam f and Ran f as 

Dam f = {x : 3y (j)f{x,y)} and Ran f = {y : 3x (j)f{x,y)}. 
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Furthermore, 

f + 9 = {{x,y) ■■ <Pf{,x,yi) A 0g(x,y2) A y=y^ + y2}, 
f oh = {{x,y) : 0/(x, f) A 0h(f,y)}. 

From these equations, it is easy to recognize the required formulas defining Domf, 
Ran f, X- f, f + g and f og. ■ 

Proposition 10.1.5. Let /, g : Q" — )■ Q be ^-definable functions. Then the / ■ g and 
1// functions are also ^-definable ones. 

Proof. Let (j)f{x,y) and (j)g{x,y) be formulas defining / and g in the language £, 
respectively. Then 

f ■ 9 = {{x,y) : (j)f{x, yi) A (f)g{x, ya) A y =y^-y2}, 
l/f = {{x,y) : 03(x,f) A 2 ^0 A y =l/f }. 

From these equations, it is easy to recognize the required formulas defining / ■ g and 
1//. * 

10.2 Continuous functions over ordered fields 

In this section we define the concept of continuity within FOL and prove some related 
theorems which are used in the proofs of the main results. 

CONT-Cousin's Lemma . Let C D OT . Assume CONT^ and AxOF. Let a, 6 G Q 
such that a < b, and let ^ be a set of subintervals of [a, b] which has the following 
properties: 

(i) beginable: for each x G [a, 6], ^ contains any small enough right and left neigh- 
borhood of X, i.e., 

\fx e [a,b]3c,d e Q, c < x < d A My e [c, d] fl [a, b] 

{y<x -^ [y,x]eA) A {x <y -^ [x,y] e A), 

(ii) connectable: if [x,y], [y, z] ^ A then [x, z] G A, 
(iii) /^-definable: the set {t G Q : [a, t] G A} is ^-definable. 

Then [a, b] G A. 
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Proof . From CONT^, it follows that the set 

H = { X G Q : a < X A Vt e (a, x) [a,t] e A} 

has a supremum since it is an ^-definable, nonempty (since A is beginable) and bounded 
set. Let us call this supremum s. We complete the proof by proving that [a, s] E A 
and s = b. 

Since A is beginable, there is a c G [a, s) such that [c, s] G A. Since s is the 
supremum of H, [a,t] G A for all t G (a, s). Thus [a, c] G ^, so by the connectability 
of A, we get that [a, s] G ^. 

If s < 6, there is an e G (s, 6] such that [s, t] G ^ for all t G (s, e] since A is 
beginable. Thus we get that for all t G (s, e] [a, t] G ^ by using the connectability of 
A and the fact that [a, s] G A. Then for all t G (a, e] [a, t] G >4,. This contradicts the 
fact that s is the supremum of the set, H therefore s = b. Pl. 

A set G C Q is called open if it contains an open interval around its every element, 
i.e., for all x G G, there are a,b E G such that x G (a, 6) C G. The open sets of 
Q form a topology, which is called the order topology. A function / : Q — )■ Q is 
called order-continuous if the inverse image of any open subinterval of Q is open, 
i.e., {x : /(x) G (c, rf)} is open for all c, d G Q. It is easy to see that while the 
order-topology is a second-order logic concept both the openness of a given set or the 
order-continuousness of a given function are FOL ones. 

CONT-order-Bolzano's Theorem . Assume CONT^ and AxPOS. Let / : Q — ?■ Q be 
an /^-definable order-continuous function such that [a, b] C Dam f. If /(a) < c < f{b), 
then there is a t G [a, b] such that f(t) = c. 

Proof. Let 

A:={[x,y]C[a,b]: {Wte[x,y] f{t)<c) V {^te[x,y] f{t)>c)} 

and assume that there is no such t G [a, b] that f{t) = c. A is /^-definable since / is 
such. A is beginable since / is order-continuous. The connectability of A is also clear. 
Thus from CONT-Cousin's lemma we get that /(t) < c for all t G [a, b] or /(t) > c for 
all t G [a, b]. So if /(a) < c and f{b) > c, then there must be a t where f{t) = c. This 
completes the proof of the theorem. Ji 

Theorem 10.2.1. Assume CONT^ and AxPOS. Let / : Q A Q be an ^-definable 
order-continuous function such that [a,b] C Domf. Then sup{f{x) : x G [a, 6]} exists 
and there is a t G [a, b] where f(t) = sup {/(x) : x G [a, &]}. 
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Proof . Let H:={f{x) : x G [«,&]} and 

A:={[x,y]<Z[a,h] : 3c G Q \/t&[x,y] f{t)<c}. 

Since A is /^-definable, beginable and connectable, H is bounded by CONT-Cousin's 
Lemma. Thus from CONT^ it follows that supH exists since H is nonempty, C- 
definable and bounded. If there is no t G [a, b] such that f(t) = sup H, then 

A:= { [x, y] C [a, h] : 3g G Q Vt G [x, y] /(t) <q < supH} 

is also /^-definable, beginable and connectable. Thus [a, 6] G ^ by Cousin's lemma, 
therefore there is a g < supH such that f{t) < q for all t G [a, 6] and this contradicts 
the supremum property. This completes the proof of the theorem. tti_: 

A function / : Q" — ;■ Q'" is called continuous at g G Dam f if the usual formula 
of continuity holds for /, i.e.: 

V£GQ+3(5gQ+ VpG/^om/ \p-q\<6 -^ \f{p)-f{q)\<e. 

The function / is called continuous if it is continuous at every g G Dam f. Let us note 
that if / : Q" — )■ Q™" is a continuous function, /Irr is also continuous for all H C Q". 

CONT-Bolzano's Theorem . Let £ D OJ^. Assume CONT^ and AxOF. Let / : Q A 
Q be an /^-definable and continuous function such that [a,b] C Dam f. If c is between 
/(a) and /(&), there is an s G [a, b] such that /(s) = c. 

Proof. Let c be between /(a) and f{b). We can assume that /(a) < f{b). Let 
H:={x G [a, 6] : f{x) < c}. Then iJ is /^-definable, bounded and nonempty. Thus by 
CONT^, the supremum of H exists. Let us call it s. Both {x G {a,b) : f{x) < c} and 
{x G {a,b) : f{x) > c} are nonempty open sets since / is continuous on [a,b]. Thus 
/(s) cannot be less than c since s is an upper bound of H and cannot be greater than 
c since s is the least upper bound. Hence /(s) = c as it was required. ■ 

Theorem 10.2.2. Let C D 07 . Assume CONT^ and AxOF. Let / : Q 4 Q be an 
^-definable and continuous function such that [a, 6] C Domf. Then the supremum s 
of {/(x) : X G [a,b]} exists and there is a y G [a, b] such that f{y) = s. 

Proof . The supremum of H:={y G [a,b] : 3c G Q Vx G [a,y] /(x) < c} exists by 
CONT^ since H is /^-definable, nonempty and bounded. This supremum has to be b 
and b E H since / is continuous on [a, b]. Thus Ran{f):={f{x) : x G [a, 6]} is bounded. 
Thus by CONT^, it has a supremum, say s, since it is ^-definable and nonempty. We 
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can assume that /(a) 7^ s. Let A:={y G [a,b] : 3c G Q Vx G [a,y] f{x) < c < s}. 
By CONT^, A has a supreinuin. At this supreinum, / cannot be less than s since / is 
continuous on [a, b] and s is the supremum of Ran{f). JH 

We call function / monotonic if it preserves or reverses the relation <, i.e., f{x) < 
f{y) [or /(x) > f{y)] for all x,y E Dom f ii x < y. 

Lemma 10.2.3. If / : Q — )■ Q is monotonic and Ran f is connected, / is continuous. 



Lemma 10.2.4. Assume CONT. Let / : Q — !■ Q be definable and continuous such 
that Dom f is connected. Then 

(1) Ran f is also connected. 

(2) If / is injective, it is also monotonic. Moreover, f~^ is also a definable monotonic 
and continuous function. 

Proof . Item (1) is a consequence of CONT-Bolzano theorem. To prove Item (2), let 
us first note that if / were not monotonic, it would not be injective by CONT-Bolzano 
theorem. It is clear that f~^ is definable and monotonic since / is such. Thus by 
Lem. 10.2.3, f~^ is continuous. ^ 

The following can be easily proved without any of the CONT axiom schemas. 

Proposition 10.2.5. Assume AxOF. Let / : Q ^ Q be a function. Then / is 
continuous iff it is order-continuous. M- 

Proposition 10.2.6. Assume AxOF. Let f\g : Q ^^ Qhe continuous functions. Then 
f + 9: f ' 9 and fog are also continuous ones. H 

Example 10.2.7. Let exp : M — ;■ M be the exponential map. Then exp is a continuous 
function but it is not OJ-'-definable. 

We call a set Z C Q" closed iff Q" \ Z is open. Let us note that {p} is closed for 
all p G Q". The following can be easily proved without any CONT schema. 

Proposition 10.2.8. Let Q be an ordered field. Let / : Q" — )■ Q™. The following 
three statements are equivalent: 

(i) / is continuous. 

(ii) The /"^-image of an open set is open. 
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(iii) The / ^-image of a closed set is closed. ^f 

We say that / : Q" — )■ Q'^ tends to g G Q'^ while x G Dom f tends to p G Q" if the 
usual formula for the limit of a function holds for /: 

ye G Q+ 3(5 G Q+ Vf G Domf < \x - p\ < 5 -^ |/(f ) - q\ <€. 

This g*is unique iff p is not isolated from the set Dom f\{p}, i.e., Bs{p)nDo'm f\{p} 7^ 
for all e G Q^. In this case we call g*the limit of the function f at p and we write 
that 

lim /(x) = q. 

x—>-q 

10.3 Different iable functions over ordered fields 

In this section we define the concept of differentiability within FOL and prove some 
theorems about it which are used in the proofs of the main theorems. 

We say that a function / : Q" — )■ Q™ is differentiable at g G Dom f if the usual 
formula 

V£ G Q+ 3(5 G Q+ Vp G Domfn Bs{q) \}\p) - f{q) - L{p- q)\ < e ■ \p- q\ 

holds for a linear map L : Q" — t- Q™. In this case L is called a derivative of / at q. 
The set of derivative maps of / at g is denoted by Der^f and any derivative of / 
at q is denoted by dq-f . Function / is called uniquely differentiable at q if it has 
one and only one derivative at q. In this case, d^f is called the derivative of / at q. 

Remark 10.3.1. We say that a binary relation is differentiable at g'if it is equal to 
a differentiable function on a small enough neighbourhood of q. 

Remark 10.3.2. If / extends /o (i.e., / 3 /o) and / is differentiable at q, then /q is 
also differentiable at q and every derivative of / at g* is also a derivative of /o at g. 

Several theorems can be proved about differentiable functions without using any 
CONT axiom schema. Here we prove some of them. To do so, we will use the following 
easily provable and well-known fact about linear maps. 

Lemma 10.3.3. Every linear map L is bounded in the following sense: there is a 
bound M G Q"*" such that \L{x) — L{y)\ < M ■ \x — y\ for all x,^ G Dom,L. 

Theorem 10.3.4. Let / be differentiable at x G Dom,f. Then there are 5,K E Q"*" 
such that \f{x) — f{y)\ < K ■ \x — y\ for all y G Dom f fl Bs{x). 
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Proof . We have to choose S and K appropriately. Since / is differentiable at x G 
Dam /, there is a hnear map L and 6 such that 

\f{y) ~ fix) -L{y-x)\<\y-x\ 

for all y G Dam f fl Bs{x). Let M be a bound of L which exists by Lem. 10.3.3, and 
let K be M + 1. Then, by the triangle inequality and the linearity of L, 

\f{y) - f{x)\ < \f{y) - fix) - L{y-x)\ + \L{y) - L{x)\. 

Thus, by Lem. 10.3.3, 

\fiy) - f{x)\ <il + M)-\y-x\=K- \y-x\ 

for all y G Dam f f] Bs{x). M. 

Corollary 10.3.5. If / is differentiable at x G Domf, then / is also continuous at 

X. 



Theorem 10.3.6. Let Q be an ordered field. Let ^ : Q" A Q"" and / : Q"" A Q^ 
such that g is differentiable at g G Q" and / is differentiable at g{q)- Let Lg G Der((f, g) 
and Lf G Der{g{q), /). Then LgoLj e Der(g, go f). 

Proof . Let s G Q"*" be fixed. Let Lg be a derivative oi g at q and Lf be a derivative 
of / at g{q)- Let K, 6q be the bounding constants of g given by Thm. 10.3.4 and let 
M be the bounding constant of Lf given by Lem. 10.3.3. Then there is a 5 G Q^ such 
that 6 < 6o, 

\9{p) - 9{q) - Lg{p- ^)\^^-\P- ^1 ^^^ 

\f{9ip)) - figm - Lf{g{p) - g{q)) \ < ^ ■ \gip) - g{q)\ 

for all p G Domg fl Bs{q) and g{p) G Dam f fl Bs{q). By the triangle inequality and 
the linearity of L, 

\f{gip))-f{giq))-Lf{Lgip-q))\ 

< \f{9ip)) - figiq)) - Lf{g{p) - g{q)) \ + \Lf{g{p) - g{q) - Lg{p- q)) \ 

- 2K ' '^*^^'* ~ ^*^^'*' ^ ^ ' '^*^^'* ~ ^*^^'* ~ ^aiP- Q)\ 

for all p G Dam f o g Ci Bs{q); and that is what we wanted to prove. fHi 
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Example 10.3.7. Let g '■ Q ^ Q and / : [—1,0] — ?> Q be defined as g{x) = x^ for 
all X G Q and f{x) = for all x E [-1, 0]. Then ^ o / = {(0, 0)}. So f,gandgof 
are differentiable at by definition, but derivative oi g o f is not unique though the 
derivatives of / and g are such. 

The following Thms. 10.3.8 and 10.3.9 can be proved by the usual textbook proofs 
of the uniqueness of derivatives. 

Theorem 10.3.8 (uniqueness of derivatives). Assume AxOF. If / : Q" — )■ Q"* is 
differentiable at g*and there is a 5 G Q^ such that Bs{q) C Doni f then the derivative 
of / at g is unique. 

When / is a function from a subset of Q to Q^, a derivative of / can be defined as 
a limit of the function h H- h i i-^-' ^^ lim/i-^o h ■ ^"^ ^'^^^ situation the 

derivatives of / at x are not linear maps but vectors. In this case, we use the notation 
f'{x) for a derivative and we call it a derivative vector of / at x. The connection 
between the two definitions is the following: dxf{t) = t ■ f'{x) for all t G Q. By the 
following theorem, the derivatives of a differentiable curve are unique. 

Theorem 10.3.9 (uniqueness of derivative vectors). Assume AxOF. The derivative 
of / at t is unique, if / : Q ^^ Q'^ is a curve (i.e., Dom f is connected and has at least 
two elements) and / is differentiable at t. 

In the case when / : Q — ?■ Q'^, we define the derivative function /' of / as the binary 
relation that relates the derivatives of / at x to x G Dom f. Of course, /' is a function 
only if / is uniquely differentiable. 

Proposition 10.3.10. Let / : Q A Q'' be a ^-definable function. Then /' C Q x Q'' 
is also £- definable. 

Proof . Let 0/(x, y) be a formula defining / in the language C. Then 

/' = { (xo,z) : M^o,yo) A Ve G Q+ 35 G Q+ Vx G Q 

|x-Xo| A 0/(x,y) -^ \y-yo- (x-xo) ■ z\<e-\x-Xo\}; 

and from this equation, it is easy to recognize the formula defining /'. jj 

Corollary 10.3.11. If / : Q — )■ Q'^' is a uniquely differentiable and /^-definable func- 
tion, /' : Q — )■ Q^ is an ^-definable function. 
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Let /i : Q — )■ Q and let H C Dom h. Then h is said to be increasing on H iff 
h{x) < h{y) for all x,y E H for which x < y; and h is said to be decreasing on H iff 
h{y) < h{x) for all x,y E H for which x < y. The proof of the following theorem also 
uses only the ordered field property of the real numbers, see, e.g., [57], [33]. 

Proposition 10.3.12. Let Q be an ordered field. Let /, (? : Q — > Q" and /i : Q A Q. 
Then (i)-(v) below hold. 

(ii) Let A G Q. If / is different iable at x, then A ■ / is also different iable at x and 
(A-/y(x) = A-/'(x). 

(iii) If / and g are different iable at x and x is an accumulation point of Dom fCiDom g, 
then f + g is different iable at x and (/ + gY^x) = f'{x) + g'{x). 

(v) If h is increasing (or decreasing) on (a, 6), differentiable at x G (a, h) and h'{x) ^ 0, 
then h~^ is differentiable at h{x). 

On the proof Since the proofs of the statements are based on the same calculations 
and ideas as in real analysis, we omit the proof, see [55, Thms. 28.2, 28.3, 28.4 and 
29.9]. ■: 

Chain Rule . Assume AxOF. Let ^ : Q" A Q"* and / : Q"^ 4 Q^ If ^ is differentiable 
at p G Q" and / is differentiable at g{;p), then go f is differentiable at p and dj;godg(.p)f 
is one of its derivatives, i.e., 

dp{,g o /) = d^g o (ig(p')/. 

In particular, ii g : Ql ^ Q™, and g is differentiable at t G Q and / is differentiable at 
g{t), then 

{gofy{t) = d,(,,f{g'{t)). 

Proposition 10.3.13. Assume AxOF. The derivative of an affine map is its linear 
part; i.e., if A : Q" — ?> Q"* is an affine map, then {d^A){p) = A{p) — A{d), where 
p,qE Q" and o is the origin of Q". 

Proof . The proof is straightforward from our definitions. J|- 

Corollary 10.3.14. Let Q be an ordered field. If ^ : Q ^ Q" and A : Q" ^ Q™ is 
an affine map, then {g o Ay(t) = A[g'(t)) - A{o). ■ 

We say that function / : Q" — t- Q is locally maximal at x G Dom f iff there is a 
S G Q"*" such that f{y) < f{x) for all y G Bs{x). The local minimality is analogously 
defined. 
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Proposition 10.3.15. Assume AxOF. If / : Q ^^ Q is different iable on {a,b) and 
locally maximal or minimal at a; G (a, b), its derivative is at x, i.e., f'{x) = 0. 

On the proof The proof is the same as in real analysis, see e.g., [57, Thm.5.8]. ■ 

Function / is said to be differentiable on set H ii H (1 Dam f and / is differen- 
tiable at x for all x E H. 

CO NT-Mean- Value Theorem . Let £ D OJ^. Assume CONT^ and AxOF. Let 
/ : Q ^^ Q be an /^-definable function which is differentiable on (a, h) and continuous 
on (a, h). li a ^ b, there is an s G (a, b) such that f'{s) = b^i ■ '^' 

Proof. Let h{t):=[f{b) — /(a)) ■ t — (b — a) ■ f{t). Then h is continuous on [a, 6], 
differentiable on (a, 6) and h{a) = f{b) ■ a — b ■ f{a) = h{b). If h is constant then 
h'{t) = for all t G {a,b). Otherwise, by Thm. 10.2.2, there is a maximum/minimum 
of h different from h{a) = h{b) in a t G {a,b). Hence h'{t) = by Prop. 10.3.15. This 
completes the proof since h'(t) = f{b) — f{a) — {b — a) ■ fit). |§ 

CONT-Rolle's Theorem . Let £ D OJ^. Assume CONT^ and AxOF. Let / : Q 4 Q 
be definable and £-differentiable function which is differentiable on (a, b) and continu- 
ous on (a, b). li a ^ b and /(a) = f{b), there is an s G (a, b) such that f'{s) = 0. ■ 

Corollary 10.3.16. Let C D 07. Assume CONT^ and AxOF. Let 7 : Q ^ Q" be 
an ^-definable and differentiable curve. Then for all distinct a, 6 G Q and for every 
(n — l)-dimensional subspace H that contains 7(a) —7(6), there is at least one c between 
a and b such that 7'(c) G H . 

Proof . The derivative vector of a curve 7 composed with a linear map A at t G Q is 
the A-image of 7'(t) by Cor. 10.3.14. Since any (n — l)-dimensional subspace of Q" can 
be taken to {0} x Q"^^ by a linear transformation, we can assume that H = {0} x Q""^. 
Recall that the function tt^ : Q" — )■ Q is defined as p i— )■ pt- Then 707rj(a) = '-foivt^b) since 
7(a)— 7(5) G H. By applying RoUe's Theorem to 707rj, we get that there is a c G Q such 
that (70 7ri)'(c) = 0. Thus 7'(c) is an element of iJ since (7 7rt)'(c) = 7rj(7'(c)) = 7'(c)t 
by Cor. 10.3.14. M 

Proposition 10.3.17. Let C D 07. Assume AxOF and CONT^. Let / : Q A Q be 
an ^-definable differentiable function (a, 6) C Dora f for some a, 6 G Q. If /'(t) = for 
all t G (a, 6) then there is a c G Q such that /(t) = c for all t G (a, 6). 

Proof . If there are x, y G (a, 6) such that f[x) 7^ f{y) and x 7^ y, then from CONT- 
Mean- Value Theorem there is a t between x and y such that f'(t)-{y—x) = f{y)—f{x) 7^ 
and this contradicts that f'{t) = 0. ^ 
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CONT-Darboux's Theorem . Let C D OT . Assume AxOF and CONT^. Let / : 
Q ^^ Q be an /^-definable and differentiable function such that (a, 6) C Dom f for 
some a, 6 G Q. If c G {f'{a), f'{b)^, there is an s G (a, b) such that f'{s) = c. ■ 

Proof . We can assume that /'(a) > d > f'{h). Let g{t):=f{t) — t ■ d. Then g is 
differentiable and g'{a) > 0, g'{b) < 0. Thus g cannot be maximal at a or 6 by 
Prop. 10.3.15. Thus, from Thm. 10.2.2, we get that there is a point, say c, between a 
and b where g is maximal. Thus from Prop. 10.3.15, we also get that g'{c) = f'{c) —d = 
0. ■ 

Let i < n. iTi : Q" — )■ Q denotes the i-th projection function, i.e., ttj : p i— )■ pi. 
Let f '■ Q, ^ Q". We denote the i-th coordinate function of / by fi, i.e., fi :=/ o m. 
Sometimes fr is used instead of /i. A function A : Q" — ?■ Q-' is said to be an afRne 
map if it is a linear map composed with a translation.^ 

The following proposition says that the derivative of a function / composed with 
an affine map A at x is the image of the derivative /'(x) taken by the linear part of A. 

Proposition 10.3.18. Assume AxOF. Let / : Q — ;■ Q" be differentiable at x and let 
A : Q" — )■ Q-' be an affine map. Then f o A is differentiable at x and (/ o A)'(x) = 
A{f{x)) - A{o). In particular, f'{x) = (/((x), . . . , /^(x)), i.e., //(x) = f (x).. 

On the proof The statement follows straightforwardly from the respective definitions. 



Proposition 10.3.19. Let C D OT. Assume AxOF and CONT^. Let f,g : Q^Q 
be ^-definable and differentiable functions on (a, 6). If /'(x) = g'{x) for all x G (a, 6), 
then there is a c G Q such that /(x) = ^'(x) + c for all x G (a, b). 

Proof . Assume that /'(x) = g'{x) for all x G {a,b). Let h:=f — g. Then h'{x) = 
f'{x) — g'{x) = for all x G {a,b) by (ii) and (iii) of Prop. 10.3.12. If there are 
y, z E (a, b) such that h{y) ^ h{z) and y ^ z, then by the CONT-Mean- Value Theorem, 
there is an x between y and 2; such that h'{x) = ^^_ ^^' 7^ and this contradicts 
h'{x) = 0. Thus h{y) = h{z) for all y,z E {(i,b). Hence there is a c G Q such that 
h{x) = c for all x E {a,b). ■ 



^That is, A is an affine map if tliere are L : Q" — > Q-* and a E Q^ such tliat A{p) = L{p) + a, 
L{p + q) = L{p) + L{q) and L{X ■ p) ^ X ■ L{p) for all p,qe Q" and A G Q. 
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10.4 Tools used for proving the Twin Paradox 

In this section we develop the tools which were used in Chap. 7. To do so, let us first 
introduce a notation. We say that p G Q*^ is vertical iff jv = o. 

Lemma 10.4.1. Let / : Q — )■ Q'^ be a well-parametrized timelike curve. Then the 
following hold: 

(i) Let X G Domf. Then /^ is differentiable at x and 1 < |/^(x)|. Furthermore, 
|/;(a;)| = liff /'(x) is vertical. 

(ii) Assume CONT and let / be definable. Then fr is increasing or decreasing. More- 
over, 1 < /^(x) for all X G Domf if /^ is increasing. 

Proof . As / is a well-parametrized curve, f'{x) is of Minkowski length 1. By Prop. 10.3.13, 
/t- is differentiable at x and f!^{x) = f'{x)r- Now, Item (i) follows from the fact that 
the absolute value of the time component of a vector of Minkowski length 1 is always 
at least 1 and it is 1 iff the vector is vertical. 

Let us now prove Item (ii). From Item (i), we have /^(x) ^ for all x G Domf. 
Thus by CONT-RoUe's Theorem, /,- is injective. Consequently, by CONT-Bolzano's 
Theorem, f^- is increasing or decreasing since /,- is continuous and injective. Let us 
now assume that fr is increasing. Then < /^(x) for all x G Domf by our definition 
of the derivative. Hence by Item (i), 1 < /^(x) for all x G Dom f\ and that is what we 
wanted to prove. ■ 

Theorem 10.4.2. Assume CONT. Let / : Q — )■ Q'^ be a definable well-parametrized 
timelike curve, and let a,b & Dom, f such that a < b. Then the following hold: 

(i) 6-a<|/.(6)-/.(a)|,and 

(ii) b-a< \fr{b) - fr{a)\ if /(x)^ 7^ f{a)„ for any x G [a,b]. 

Proof . For every i < d,we have that /j is definable and differentiable by Prop. 10.3.13. 
Hence by the CONT-Mean- Value Theorem, there is an s G {a,b) such that /^(s) = 



Mb)-Uia) 



b—a 



By Item (i) of Lem. 10.4.1, we have 1 < |/;(s)|. Then b-a < |/^(5)-/^(a) 



That completes the proof of Item (i). 

To prove Item (ii), let x G [a,b] such that f{x)cr 7^ f{o)cr- Then there is an i > 1 such 
that fi{x) 7^ fi{o)- Hence by the CO NT- Mean- Value Theorem, there is a y G (a, fe) 
such that /j'(y) = ^ ~ " 7^ 0. Thus f'{y) is not vertical. Therefore, by Item (i) of 
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Lem. 10.4.1, we have 1 < \f!^{y)\. Thus by our definition of the derivative, there is a 
z E {y, b) such that 1 < ' ^^^ _ ^^^^' . Hence we have 



z-y 



z~y< \fr{z) - f^{y)\. 

Let us note that a < y < z < b. By applying Item (i) to [a, y] and [z, b] we get 

y - a < \friy) - fria)] and b - z < \frib) - fr{z)\. 

fr is increasing or decreasing by Item (ii) of Lem. 10.4.1. Thus /r(a) < friy) < 
friz) < frib) or /r(a) > friy) > fr^z) > frip). Therefore, by adding up the last three 
inequalities, we get b — a < |/t(^) — /r(a.)|. ■ 

Theorem 10.4.3. Assume CONT. Let /, (? : Q — )■ Q'^ be definable well-parametrized 
timelike curves. Let a, b, a', 6' G Q such that 

• a < b and a' < b', 

• [a,b] C Doni f and [a',b'] C Domg, and 

• {fit):te[a,b]} = {git'):t'e[a',b']}. 

Then b — a = b' — a'. 

Proof . By (ii) of Lem. 10.4.1, fr is increasing or decreasing on [a,b] and so is Qr on 
[a',b']. Without losing generality, we can assume that Dam f = [a,b], Domg = [a',b'] 
and that /^ and Qr are increasing on [a,b] and [a', 6'], respectively.^ Then Ran{f) = 
Ran (g) by the assumptions of the theorem. Furthermore, / and g are injective since 
/t- and gr are such. Since Ran (/) = Ran (g) and g^ is injective, / o g~^ = fr ° Qt^- 
Let h:=f o g"^ = fr ° 9t^- Since Ran (/^) = Ran {gr) and fr and gr are increasing, 
h is an increasing bijection between [a, 6] and [a', 6']. Hence h{a) = a' and h{b) = b'. 
We prove that b — a = b' — a' hj proving that there is a c G Q such that h{x) = x + c 
for all X G [a,b]. We can assume that a ^ b and a' ^ b' . By Lem. 10.4.1, fr and gr 
are differentiable on [a,b] and [a', 6'], respectively, and /^(x) > for all x G [a, 6] and 
g'r{x') > for all x' G [a', b']. By Chain Rule and (v) of Prop. 10.3.12, h = frog'^ is also 
differentiable on (a, b). By h = f o g~^^ we have f = ho g. Thus /'(x) = h'{x)g'(^h{x)^ 
for all X G (a, 6) by Chain Rule. Since both f'{x) and g'(^h{x)) are of Minkowski length 
1 and their time-components are positive'^ for all x G (a, b), we conclude that h'{x) = 1 



^It can be assumed that /r is increasing on [a, b] because the assumptions of the theorem remain 
true when / and [a, b] are replaced by —Ido f and [—6, —a], respectively, and /t is decreasing on [a, 6] 
iff {—Ido f)r is increasing on [—6, —a]. 

^That is, /;(x) > and g;(/i(a;)) > 0. 
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or 



75 = /(7(2/))l 




— spacelike 

— timelike 
- lightlike 

Figure 10.1: Illustration for the proof of Prop. 10.4.4 



for all X G (a, b). By Prop. 10.3.19, we get that there is a c G Q such that h{x) = x + c 
for all X G (a, b) and thus for all x G [a, b] since h is an increasing bijection between 
[a, 6] and [a', 6']. ^ 

A curve is called slower than light (STL) and faster than light (FTL) iff any of its 
chords is timelike and spacelike, respectively. 

Proposition 10.4.4. Let C 3 OF. Assume AxOF and CONT^. Let 7 : Q A Q'^ be 
an £-definable and continuous curve. Then (i) and (ii) below hold: 

(i) 7 is timelike =^ 7 is STL. 

(ii) 7 is STL, FTL or it has a lightlike chord. 

Proof . To prove the first statement, let us assume that 7 is not STL. Then it has a 
lightlike or spacelike chord, say {p, ^}. Let H he a, {d — l)-dimensional subspace that 
contains p — q and does not contain timelike vectors. Thus by Cor. 10.3.16, we get that 
there is a t G Q such that 7'(t) is in H . Since H does not contain timelike vectors, 
7'(t) is not timelike. Thus 7 is not timelike. 

To prove the second statement, let us assume that 7 is not STL or FTL and does not 
have a lightlike chord. Then 7 has both timelike and spacelike chords. Then there are 
distinct points a,b,c & Ran (7) such that the triangle {a, b, c} determines two timelike 
and one spacelike or two spacelike and one timelike chords of 7. We can assume that 
7(0) = c and c is the intersection of the chords of same type. See Fig. 10.1. 
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For every p G Ran (7) by CONT, there is a closest t G Q to such that 7(t) = p, 
i.e., the set H\={ \x\ : 7(x) = p} has a minimal element.'' Thus there is a t G Q such 
that 7(t) is a or 6 and there is no t' between and t such that 7(t') is a or h. We can 
assume that 7(t) = a and t > 0. 

Let f ■ Q,^ \ {b} — 7- Q be the function defined as p t-)- -j^rjr- It is easy to see that 
/ is continuous and for all p G Q'^ \ {& } 



f{p) = 1/v 2 <^=^ p — b is lightlike, 

f{p) > I/V2 -^^ p- 6 is timelike, (10.1) 

f{p) < l/v2 <^==^ p — 6 is spacelike. 

Consider the function g:=^\. , o /. It is a continuous function. Furthermore, Dom g = 
[0,t] since there is no t' G [0,t] such that 7(t') = b. By (10.1) above and by the fact 
that 7(0) = c and 7(t) = a, we have that 



(^(0) > 1/V2 and^(t) < 1/V2) or (^(0) < 1/V2 and ^(t) > 1/V2) 

since one of the chords {b, c } and {b, a } is timelike and the other is spacelike. However, 
by CO NT-Bolzano's Theorem, there is a y G [0,t] such that g{y) = l/v2- Hence by 
(10.1) above, we have that 7(1/) — b is lightlike for this y. Consequently, {b,j{y)} is a 
lightlike chord of 7. This contradiction proves our proposition. ■ 

10.5 Tools used for simulating gravity by acceler- 
ated observers 

In this section we develop the tools which were used in Chap. 8. To do so, let us first 
introduce the following convenient notation. We say that a : Q — )■ Q is a nice map if 
it is differentiable such that ^ Ran a', and Dom a is connected. 

Lemma 10.5.1. Let a be a timelike curve. Then a,- is a nice map. 

Proof . Since a is a timelike curve, Dom a is connected and a'{x)r 7^ for all x G 
Dom a. But DomUr = Dom a and (ftr)' = (ctO-r- Thus ar is a nice map. ■ 

Lemma 10.5.2. Assume CONT. Let a be a definable nice map. Then a is injective. 
Moreover, a is monotonic. 



^That is so because of the following. Let s be the supremum of the nonempty bounded definable 
set { — |a;| : j{x) = p}. By the continuity of 7, one of 7(5) and "f{—s) must be p. Then — s is the 
minimal element of H. 
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Proof . If a were not injective, then a'{x) would be for some x by CONT-Rolle's 
Theorem. But a'{x) cannot be since a is a nice map. Thus a is injective. Then a is 
also monotonic by (2) in Lem. 10.2.4. Pl 

Lemma 10.5.3. Assume CONT. If a and 6 are nice maps, 6~^ and aoS are also nice 
maps. ■ 

Lemma 10.5.4. Assume CONT. Let a and 6 be definable timelike curves such that 
Ran a C Ran 6 (or Ran6 C Ran a), and let h:=a o S~^. Then h is a. nice map and 

\h'{x)\= ,, I. for all xeDomh. (10.2) 

fi{6'{h{x))) 

Proof . First we show that h = a^ o 5^^. Since a and 5 are definable timelike curves, 
a^ and 5r are definable nice maps by Lem. 10.5.1. Thus a^ and 5r are injective by 
Lem. 10.5.2. Consequently, a and 5 are also injective. Therefore, {x,y) G a,- o 5^^ iff 
aT-(x) = 5r{y) and (x, y) G a o 5~^ iff a{x) = S{y). Since a{x) = 5{y) — )■ ^^(a;) = 
5r{y) is clear, we have to show the converse implication only. By symmetry, we can 
assume that Ran a C Ran 6. Then there is a 2 G Dam 6 such that 6{z) = a{x), so 
Sr{z) = a^{x) = 5r{y). Thus z = y since 6 is injective, so a{x) = 5{y). That proves 
h = ar o S~^. 

By Lem. 10.5.3, h is a. nice map, so Domh is an interval. We have that a D /i o 5 
since h = aoS~^. Thus by Chain Rule, a'{x) = h'(x) ■5'(/i(x)) for all x G Dam h. Since 
fii^Xp) = |A|-/i(p) for all A G Qandp G Q'^, we have that /i(a'(x)) = |/i'(x)|-/i(5'(/i(x))) 
for all X G Domh. We have that /i(5'(/i(x))) 7^ since 5 is timelike. Hence equation 
(10.2) holds. m 

Lemma 10.5.5. Assume CONT. Let f3 and 7 be definable and well-parametrized 
timelike curves; let /3^, and 7* be definable timelike curves; let Xf^.yp G Doml3, x^,y^ G 
Dom 7 and x, y G Dotti /3* fl Dotti 7* such that 

(i) Ran /3^, C Ran P and Ran'-f^, C Ran'j. 

(ii) /3(x/3) = /3*(x), /3(y^) = /?*(?/), 7(x^) = 7*(x), 7(?/^) = 7*(y). 

(iii) X 7^ y and ;u(7;(z)) > iJ,{l3'^{z)) for all 2; G (x,?/). 

Then Ix^ ~ ^7! > |a^^ — Z/^l- 
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/3*,/3 



7*, 7 




Figure 10.2: Illustration for the proof of Lem. 10.5.5 



Proof . Since /3, /3*, 7 and 7* are definable timelike curves, they are injective by Lems. 
10.5.1 and 10.5.2. Thus X/3 7^ y/3 and x^ 7^ y^ since x ^ y. Let 

i:=(3o(3-^ and j:=7^o7-\ 

see Fig. 10.2. Then i, j and ioj are nice maps by Lem. 10.5.3 and 10.5.4. Furthermore, 

i{xp) = x, i{yp) = y^ j(x) = x^, j(y) = y^^ (zo j)(x/3) = x^ and {i o j){yp) = y^. 

Since X/3, yf^ G Dam i o j, and z o j is a nice map, we have that (x^, y^) C Dom i o j. 
Now we will show that 



^te{xp,yp) |(zojy(t)| >1. 

To prove this statement, let t G {xp.yji). Since 2 is a nice map, it is monotonic by 
Lem. 10.5.2, thus i{t) G {x,y). By Lem. 10.5.4 and the fact that f3 and 7 are well- 
parametrized, we have that 

/i(/3'(t)) _ 1 



N'W 



fiiP'Mt))) f^iP'Mt))) 



and 






From equations (10.3), (10.4) and Item (iii) by Chain Rule, we have that 
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;io.3) 



(10.4) 



This completes the proof of (10.5). 

By CONT-Mean- Value Theorem there is a 2; G {xp^yp) such that 



{lOj)\z) 



xp -yp xp- yp 



By this and (10.5), we conclude that |f^3^| > 1- Hence \x-y — y.y\ > |x/3 — y/s], as it 
was required. M 

Remark 10.5.6. Lem. 10.5.5 remains true even if we substitute "=" or ">" for ">". 
The proof can be achieved by the same substitution in the original proof. 

Proposition 10.5.7. 

(1) Let a be a well-parametrized timelike curve. If a is twice differentiable at t G 
Doma, then a'(t)±^a"(t). 

(2) Let d>3. Assume AccRelo. Let A; G Ob and m G 10b. Then v^(t)±^a^(t) for 
all t G Domai^. 

Proof . To prove Item (1), let t G Doma such that a is twice differentiable at t. Since 
a is a well-parametrized timelike curve, we have that 

{a[{t)Y-{a',{t)f-...-{a',{t)Y = l. (10.5) 

By derivation of both sides of equation (10.5) we have that 

2a; (t) ■ a'[{t) - 2a',{t) ■ a'i{t) - ... - 2a'^{t) ■ a'^{t) = 0. 

Thus a'{t)-Lfj_a"(t), which is what we wanted to prove. 

Item (2) is a consequence of Item (1) since v^ = (Ic^)', a^ = (Ic^)", Ic^ is a 
well-parametrized timelike curve by Thm. 6.1.11, and Ic^ is twice differentiable at t iff 
t G DomsL^. M 

If / : Q ^ Q, we abbreviate /(t) > for all t G Dom / to / > 0. We also use the 
analogous notation / < 0. 

Lemma 10.5.8. 

(1) Assume CONT. Let a be a definable and twice differentiable timelike curve such 
that Ran a C tx-Plane. If a" o // < 0, then a'^ is increasing or decreasing. 

(2) Let d> 3. Assume AccRel. Let A; G Ob and m G lOb such that vj\jn{k) C tx-Plane 
and Domai^ = Dom Ic^. If k is positively accelerated, (v^)2 is increasing or 
decreasing. 
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Proof . To prove Item (1), let t G Dam a. By Prop. 10.5.7, a"(t) is a spacelike vector 
since it is Minkowski orthogonal to a timelike one. Therefore, fi{a"{t)) < iff |q;ct(^)| 7^ 
0. Thus, since Ran a C te-Plane, we have that fi{a"{t)) < iff C(2{t) 7^ 0. Thus by 
CONT-Darboux's Theorem, a" o /i < iff 02 > or a'g < since aj is definable and 
Dam a' = Dam a is connected. Then by CONT- Mean- Value Theorem, 02 is increasing 
or decreasing. 

Item (2) is a consequence of Item (1) because of the following. Let a = lc„. Then 
by Thm. 6.1.11, a is definable (well-parametrized) timelike curve, a twice different iable 
since Dam a" = Domaf^ = Dom\c^ = Dam a; and Ran a C tx-Plane since by (5) in 
Prop. 6.1.6, wlm(A;) = Ran\c^. Then k is positively accelerated iff a" o ^ < 0. Hence 
by Item (1), if k is positively accelerated, (v^)2 = 0^2 is increasing or decreasing. ■ 

Let us introduce the following notation: 

dwi{p):=wi{p) - wi{o). 
Proposition 10.5.9. Let d > 3. Assume SpecRel. Let m,kE lOb and h G Ob. Then 

(1) p±f,q iff dwi{p)±^dw'^{q). 

(2) v^ = v^ o dw'^ and Dom'v^ = Doni\j^. 

(3) a^ = a^ o dw^ and Dam a^ = Dam a^. 

Proof . To prove Item (1), observe that n(^dw'^{p)) = lJi{p) by Thm. 3.2.2. The 
statement P-i-^q iff fi{p + g)^ = fi{p)'^ + n{qY can be proved by straightforward 
calculation. Thus Item (1) is clear since dw^ is linear by Thm. 3.2.2. 

To prove Items (2) and (3), let us note that lc„ and Ic^ are functions by Item (2) 



k 



in Prop. 6.1.6. Thus v^ = v^ o dw^^ follows by Chain Rule because Ic^ = Ic^ o w 
(by (3) in Prop. 6.1.6), the derivative of w^ is dw^ (since w^ is affine transformation 
by Thm. 3.2.2), and v^ = (Ic^)' (by definition). Hence Domi^l^ = Domini} also holds 
since dw^ is a bijection. a^ = a^ o dw^ follows from (2) of this proposition by Chain 
Rule because the derivative of dw^ is dw^ (since dw^ is a linear transformation), 
and a^ = (v^)' (by definition). Hence Doma.^ = Domaj! also holds since dw^ is a 
bijection. Mi 

The light cone of p G Q"^ is defined as Ap-:={g G Q'^ : p\q}- The past light 
cone of p G Q'^ is defined as AZ:={q G Q"^ : p Xq A qr ^ Pt}- The future 
hght cone of p G Q'' is defined as At:={q G Q'^ : p Xq A qr > Pt}- We say 
that p E of" chronologically precedes q G Q"', in symbols p <^q, iff p r g and 
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Pr < Qt- The chronological past of p G Q*^ is defined as 1 0'-= {q ^ Q^ '■ q ^ p }■ 
The chronological future of p G Q*^ is defined as /jt:={^€Q'^ : p^g}. The 
chronological interval between p G Q'^ and g G Q'^ is defined as {p, q} :={f & Q'^ : 
prf A qrr A Vr E {pT,qT)}- We also use the notation 1 0:=IZ U / 1 U {p}. 

Lemma 10.5.10. Let p , g G Q'^. Then 

(1) Ifprg, thenA. nA. =AtnA+ = 0. 

(2) If p < g , then A. n /. = 0, and Az U Iz c I^ . 

(3) p«g-iff/+n/^::7^0. M 

Lemma 10.5.11. Assume CONT. Let 7 be a definable timelike curve, and let x,?/ G 
Doni'-f such that x ^ y. Then 

(1) All the chords of 7 are timelike, i.e., 7(0;) T'j{y). 

(2) If 7(x) G /pT and 7(2/) ^ J-T, there is a 2; G [x, y] such that 7(z) G AZ. 

(3) If 7(x) G /i" and 7(2/) ^ Jt, there is a 2; G [x, y] such that 7(z) G At. 

(4) If 7^ is increasing (decreasing), 7(x) <^ 7(1/) iff x < y {y < x). 

(5) 2 G (x,?/) iff 7(2) G ((7(x),7(y))). 

Proof . Item (1) follows from Prop. 10.4.4. To prove Item (2), let 

H:={te [x,y] ■ /i(7(t),p) <0 A 7(t), < p, } . 

It is clear that H C Doni'-f is definable, bounded and nonempty. Let z:=supH which 
exists by CONT. Thus by continuity of 1 1-)- /i(7(t),p) and 7t-, we have that 7(2;) ^ /^ 
since z is an upper bound of H. Furthermore, fi{'~f{t),p) < and 7(^)1- < Pr since z 
is the least upper bound of H. But 7(t)r = Pr and fi{'y{t),p) < is impossible. Thus 
l{t)T < Pt and fi{'y{t),p) = 0. Hence 7(p) G A^. 

Item (3) is clear from Item (2) since the continuous bijection p \-^ —p takes It to 
/landA+toAz. 

Item (4) is clear by Item (1). 

Item (5) is a consequence of Item (4) since 7,- is either increasing or decreasing by 
Lems. 10.5.2 and 10.5.3. M 
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We use the following notations: 

Conee{'P]q):= M line{p ,f) and A^[if]:= M A-;. 

Let a and /3 be timelike curves. We say that /S^, is the photon reparametrization 
of (3 according to a if 

/3* = {{t,p) G Doma x Ran 13 : p G A~,^.}. 

Proposition 10.5.12. Assume CONT. Let a and (3 be definable timelike curves. Let 
/3^ be the photon reparametrization of (3 according to a. 

(1) Then /3* is a definable, continuous and injective curve. 

(2) If Ran a fl Ran (3 = 0, and Ran a U Ran (3 is in a vertical plane, /3* is a timelike 
curve, and /3,(to) + /3Uto) e A" ^^)^^,(^^). 



/3'(io) 
/3:(to) ._ 




I 1 ,-ui\^ 




a(t)-a(to) 
t-to 


-^"A IJ^-^K V 


',«(*) 




V 


ml- 


("(io) 




Mt) 


"-^i 

1 


'/3(to) = /5*(to) 







Figure 10.3: Illustration for the proof of Prop. 10.5.12 

Proof . It is clear that /3* is definable. 

To show that /3* is a function, we need to prove that A".^-, fl Ran /3 has one element 
at the most for all t G Doma. It is clear by Lem. 10.5.11 since if it had two distinct 
elements, say p and q, then {p, q} would not be a timelike chord of /3, but a lightlike 
one. 
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For all t G Domfi^,, let t G Dom P such that /3(t) = fi^{t), and let / : t t-> t be the 
reparametrization map, i.e., /:=/?* o /3~^. First we show that 

te{tiM) o «(t) G ((«(ti),«(t2))) ^4 /3.(t) G ((/3*(ti),/3*(t2))) ^te(ti,t2) 

if t,ti,t2 G DomP^. The first and the last equivalence are clear by (5) in Lem. 10.5.11 
since a, /3 are timelike curves and /3(t ) = /3*(t) for all t G Dom fi^. To prove (*), we can 
assume that a{ti) <^ a{t) <^ a{t2)- Thus /3*(t) <^ a(t2) since A~.^v C I~,^ x (2) of by 
Lem. 10.5.10. Therefore, /3*(t) < /3*(t2) since /3*(t) G Ip^^t^) by (1) in Lem. 10.5.11, but 
^t*{t2)^'^a{t2) ~ ^ ('^) ^y Lem. 10.5.10. A similar argument can show that /3*(ti) ^ /3*(t), 
so (*) is proved. Now we have that / preserves betweenness, so it is monotonic. 

To show that Dom 13^ is connected, let x, y G DomP^,, and let z G {x,y). Then 
z G Dom a since x, y G -Dorra a and Dotti a is connected. Since a is a timelike curve, 
a{z) G {a{x),a{y)}. Without losing generality, we can assume that a{x) <^ a{z) <ti 
a{y). Then /3*(x) G /^(^^ since /3*(a;) G A;;;^^.) C J^^^-,; and /3*(y) ^ /^^^^^ since l3^{y) G 
A;;^^) and A;;^^) fl J^^^-, = 0, see Lem. 10.5.10. Then by (2) in Lem. 10.5.11, there 
is a ^ G Dom P such that f3{z) G A~. s since f3{x) G /~/^n and /3{y) ^ ^alz)- Thus 
{z^Piz)) G /3*. Consequently, 2 G Dom (3^. Hence Dom, (3^ is connected. 

Now using a similar argument, we show that Ran f C Dom, 13 is also connected. 
To do so, let X, y G Ran f and S" G {x,y). Then £ G Dom 13. We can assume that 
/3(x) <^ f3{z) ^ f3{{y)- Then a(x) G Jt.j.^ and a;(i/) ^ ^tiz)- Thus there is a 2; G Dom a 
such that 0(2) G At/^y Consequently, /3{z) G A^, ^ so {z,/3{'z)) G /9*. Therefore, 
2 G -Ran /, and hence Ran f is connected. 

Since Ran f is connected and / is monotonic, / must be continuous by Lem. 10.2.3. 
Hence /3* = / o /3 is also continuous and /3* injective since both /3 and / are such. So 
Item (1) is proved. 

To prove Item (2), let q = a' (to) + tt(to), r = /3'(to) + /3(to), and let p be the unique 

element of A-I fl line{f3(tQ),r), see Fig. 10.3. We will show that /3^to) = p — /3*(to)- 

To do so, let e G Q"^ be fixed. We have to show that there is a 5 G Q''" such that 

' ;~f ° G BJp) if t G Dom /3* fl BsUq). It is clear that we can choose Ei and £2 such 

that 

A-[5,,(g-')] n Cone,,(/3,(to);r) C 5,(p). (10.6) 

Since a is differentiable at to) there is a (5i G Q"*^ such that 

°W-f°' +.fa)gB,.m (10.7) 

r — to 

if t G Do?Ti a n i?5^ (to)- Since Ran (3 fl i?a?T. a = 0, and Ran [3 U i?a?T. a is in a vertical 
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plane, line(^f3^:(t),a(t)) and Zme(/3*(to)7 «(^o)) are parallel. Hence 



Thus by (10.7), we have that 



+ f3.{to)eA-[B,,{q)] (10.8) 



t-to 
if t G Dom /3^ n Bs^ito). Since (3 is differentiable at to, there is a ^2 G Q"*" such that 

^-^^M^ + m ^ B,.Ar) (10.9) 

t — to 

if t G Dom 13 n Bg^lio). Since / : t H- t is continuous, there is a ^2 G Q^ such that 
(10.9) holds if t G Dom 13^ n ^^.(to). Since 

(3,{t) - /J,(to) _ f3{i) - /3(to) t-to 

t — to t — to t — to 

we have that 

^*^^l~ ^*^^'K Mto) G Cone,,(/J.(to);f) (10.10) 

t — to 

if t G Dom l3^.P\Bs^{tQ). Let 5 = min(5i,52). Therefore, by equations (10.8) and (10.10), 

we have that 

^*^^!~f^^°^ + nto) e A-[B,A<i)] n Cone^Mito); n 

t — to 

if t G Dom, 13^ n Bsito). But the latter is a subset of B^{p) by equation (10.6). Conse- 
quently, 

t — to 
if t G Dom. f3^ n Bsito). Hence /3* is differentiable at to and /3^(to) = p — /3*(to), as it 
was required. J(L, 

The following example shows that the assumption Ran a fl Ran /3 = is necessary 
in item 2 in Prop. 10.5.12. 

Example 10.5.13. Let A G Q such that 1 < A or A < —1. Let timelike curves a and 
/3 be defined as a(t) = (t, 0, . . . , 0) and f3(t) = (A ■ t, t, 0, . . . , 0) for all t G Q. Then the 
photon reparametrization of (3 according to a is: 

I (a^-^>a^-^.0,...,0) t<0, 
see Fig. 10.4. Therefore, /3* is continuous, but it is not differentiable at t = 0. 
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\A+1 ' A+1 ''i^' 

y (A,i',o,...,o) 




,0) 



(-A,-1,0,...,0) 



Figure 10.4: Illustration of Example 10.5.13 
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\ 2 ' 2 ''^Z 



Figure 10.5: Illustration of the photon sum p A q, and for the proof of Lem. 10.5.14 
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Let p,q & tx-Plane. Then the photon sum of p and q, in symbols p Aq, is 
the intersection of the two photon hnes {p + {A, A,0, . . . ,0) : A G Q} and {q + 
(fi,-fi,0,...,0):fiGQ}. 

Lemma 10.5.14. Let p , g G te-Plane, and let a = ^^ and b = ^^^- Then 
p Aq = {a + b,a — b,0, . . . ,0). 

Proof . The proof is straightforward by the respective definitions, see Fig. 10.5. ■ 



Lemma 10.5.15. Assume CONT. Let /3 be a definable timelike curve. Then /3 ^ : 
Ran 13 — > Domfi is definable, injective and continuous. 

Proof . It is clear that /3~^ is definable and injective. 

Since by Lems. 10.5.1 and 10.5.2 /3 is injective, /3~^ is a function from Ran (3 to 
Dam p. To prove that it is also continuous, let to ^ Domp. We have to show that 
for all e G Q^, there is a 5 G Q"*" such that if t G Domfi and |/3(t) — /3(to)| < 5, then 
|t-to| <e. By Lem. 10.5.11, t G {tQ-e,tQ + e) iff /3(t) G ((/3(to - e), /3(to + £)))• Thus, 
since ((/3(to ~ ^)) /3(to + £^))) is an open set, there is a good 5. ,J/k. 

Lemma 10.5.16. Assume CONT. Let /3 be a definable timelike curve and /3* a 
definable continuous curve such that Ran 13^ C Ran P, and let /:=/?* o /3~^. 

(1) Then / is a definable and continuous function. 

(2) If /3* is injective, / is also injective. Moreover, Dam f and Ran f are connected 
and /"^ is also a definable, monotonic and continuous function. 

(3) If /3* is differentiable such that f3i(t) ^ o for all t G Dam P, then / is injective 
and differentiable, and f'{t) ^ 0. Hence f~^ is also a differentiable function. 

Proof . Item (1) is clear by Lem. 10.5.15. 

Item (2) is clear by Item (1) and Lem. 10.2.4 since Dam f = Dam P^, which is 
connected. 

To prove Item (3), let to G Dam f . Since Ran P^ C Ran (3, we have that there is 
a A G Q such that A ■ /3'(to) = /3U/(to)). Since (/(t) - /(to))/(t - to) is the ratio of 

parallel vectors 

/3(t)-/3(to) /3.(/(t))-/3.(/(to)) 

t-to fit) -f (to) ' 

we have that (/(t) - /(to))/(t - to) tends to /3'(to)//3U/(^o)) = 1/A if t tends to to. 
Thus / is differentiable, and /'(to) = 1/A. ,l|.. 
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Lemma 10.5.17. Assume CONT. Let a be a definable timelike curve. Let t G Doma 
and X = ttrit). Let fa:=a~^ o a^ 



t(j. 



(1) Then fa is a differentiable curve, and /^(x) = a'^{t) / a'^(t) . 

(2) If a is twice differentiable at t, then so is /« at x, and 



/:(^) 



<{ty 



Proof . Let us first prove Item (1). We have that a,- is injective by Lems. 10.5.1 and 
10.5.2. Hence fa is a function. Dam fa is connected since Dam fa = Ranar and 
Ran Or is connected by Lem. 10.2.4. Thus fa is a curve. Since Or is an injective 
differentiable curve, a~^ is also such and {a~^y{x) = l/a'^{t). Thus by Chain Rule, we 
have that f^{x) = a'^{t) / a'^{t) . 

Now let us prove Item (2). If a is twice differentiable at t, then so are a^ and a,-. By 
Item (1), /^ = a~^ o a'^/a'^. Thus /„ is twice differentiable at x and a straightforward 
calculation based on the rules of differential calculus can show that /^(x) is what was 
stated. 



Lemma 10.5.18. Assume CONT. Let a and /3 be definable timelike curves such 
that Ran a U Ran 13 is in a vertical plane. Let ti,t2 G Doma and ti,t2 G Dom (3 such 
that a(ti)cr/3(ti), 0(^2) o-/3(t2) and (/3(ti) - a(ti)) tt ("(^2) -/3(^2))- Then there is a 
t G (^1,^2) such that a;(t) G Ranfi. Hence Rana fl Ran P 7^ 0. 

Proof . Since Rana U Ran 13 is in a vertical plane, we can assume, without losing 
generality, that d = 2. By CONT-Bolzano's Theorem, we can also assume that a(ti)r = 
/3{ii)T and a(t2)r = /3(^2)r- Let Xi = a(ti)^ and X2 = a(t2)r- Let fa'-=(^r^ o tto- 
and fj3:=f3~^ o /3^. Then /q, and /^ are continuous curves, see Lem. 10.5.17. By the 
assumption [f3(ti) — a(ti)) H («(t2) — f^ih)), "we have that (//^(xi) — /^(xi)) (/a(x2) — 
fi3{x2)) < 0. Thus by CONT-Bolzano's Theorem, there is an x G (xi,X2) such that 
/a(x) = fi3{x). Let t:=a~"'^(x). Then a(t) G Ran (3. .B 

Let a and (3 be timelike curves. We say that /3* is the radar reparametrization 
of /3 according to a if 

/3, = {{t,p) G Doma X i?an/3 : 3r G Q p G A'^^^^ n A+^_^)}. 

We say that /3 is at constant radar distance r from a iff 

RanB C I I A" _^ , fl A+ ,. 

f^ — V^_/ a{t+r) a(t—r) 

Hr&Dom a 
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Let us note that this r can be negative if a^ is decreasing since by this definition 
a{t-r) < a{t + r). 

Proposition 10.5.19. Assume CONT. Let a and /3 be definable timehke curves. Let 
/3^, be the radar reparanietrization of /3 according to a. 

(1) Then f3^: is a definable, injective, and continuous curve. 

(2) If Ran a U Ran /3 is in a vertical plane, and /3 is at constant radar distance r from 
a, then /3* is differentiable. 

(3) Let us further assume that this vertical plane is the tx-P\ane. Then 

^i{t) = a'{t-r)Aa'{t + r) iff (/3,(t) - a(t)) tt U, 
Pi{t) = a\t + r)Aa'{t-r) iff (/3,(t) - a(t)) tt-t- 

Proof . It is clear that /3* is definable. Without losing generality, we can assume that 
ar is increasing, see Lems. 10.5.1 and 10.5.2. 



Domg 



Dam f \-t 



Ranh< --h{t) 




a \ 




-t 



Domh 



Rang 



Figure 10.6: Illustration for the proof of Prop. 10.5.19 



To show that /3* is a function, let (t, p ), (t, q) G /3*. Then there are r, s G Q such that 

v G A"., , ^ n A"*",, s and q G A",, , ^ fl A"^., s. We can assume that < r < s. Since 

^ a(t+r) a{t~r) ^ a(t+s) a(t~s) — — 

both a and /3 are timelike curves, p = g iff r = s. Therefore, if p 7^ q., a{t+r) <^ a{t + s) 
and a{t - s) < a{t - r). Thus g ^ J^" since /r C /^(^^^^ and /"(^^^^ n A^^^^^^ = 0; 
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and q^l^ since it C lX{t-r) ^'^^ ^l{t~r) ^ K{t-s) = ^- Thus p = q since «f e Jp- by 
Lem. 10.5.11. 

For all t G Dom fi^, let t G Domfi such that /3(t) = P*{t), and let / : t t-> t 
be the (radar) reparametrization map, i.e., /:=/?* o /3^i. Then / is injective since 
if A"., , . n A+,, ^ n A",, , ^ n A+., ^ ^ 0, then ti = tg and r = s, see (1) in 

ayti+r) a{ti—r) a(t2+s) 0(12— s) / '^' 1 z 1 \ / 

Lem. 10.5.10. Let g and h be the photon reparametrization maps of (3 according to 
a and of a according to /3, respectively. Then g, g^^ and h, h~^ are monotonic and 
continuous bijections between connected sets, see Prop. 10.5.12 and Lem. 10.2.4. It is 
clear by the respective definitions, that 

for all t G Ranf, see Fig. 10.6. Thus /^^ is continuous since both h and g~^ are such. 
It is clear that Dom f~^ = Ranf = Dorah fl Rang. Thus Dom f~^ is connected 
since both Domh and Rang are such. Therefore, Dom 13^ = Dom f = Ran f~^ is also 
connected and / is definable and continuous, see Lem. 10.2.4. Hence /3* = / o /3 is also 
continuous; and /3* is injective since both /3 and / are such. So Item (1) is proved. 

Now let us prove Item (2). If r = 0, then /3^, is the restriction of a to Dom (5^ which 
is connected, thus it is obviously different iable. If r 7^ 0, then Ran ad Ran P = 0. Thus 
by (2) in Prop. 10.5.12 and Lem. 10.5.16, we have that h and g~^ are differentiable. 
Thus / is also differentiable. 

To prove Item (3), let Ran a U Ran (5 C tx-Plane. By Item (2) of this proposition, 
(3^, is differentiable. It is not difficult to see that 

/3Jt) =a(t-r) Aa(t + r) iff (l3Jt) - a(t)) tfl^ and 

^ (10.11) 

/3,(t) = a{t + r) X a{t - r) iff (/3,(t) - a{t)) ^-1, 

if t G Dom P^: since /3 is at constant radar distance r from a. By Lem. 10.5.18, we have 
that the direction of /3*(t) — a{t) cannot change. Thus it is always the same equation 
in (10.11) that holds for /3*. Hence Item (3) follows from Lem. 10.5.14 by an easy 
calculation. JP:. 

If a : Q A Q'^ and p G Q'^, we abbreviate a{t) f^p for all t G Dom a to a f\'p. We 
use analogously the notation a H /3 if a, /3 : Q — )• Q'^. Let a :={a2, «i, «3, • • • , ctd) for 
all a : Q — 7- Q'', i.e., the first two coordinates are interchanged. 

Lemma 10.5.20. Assume CONT. Let a be a definable timelike curve. 

(1) Thena'ttltor a'tt-lt- 
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(2) If Rana C tx-Plane, then a' tfl^ iff a' ttU and a' tt-l^ iff a' tt-lt- 

(3) If a is twice differentiable, Rana C tx-Plane and o ^ Rana", then a" tt Ix 
(a" tt — Ij:) iff ^2 is increasing (decreasing). 

(4) If a is twice differentiable, Rana is in a vertical plane and o ^ Rana", then 
a"(ti) tta"(^2) for all ti,t2 G Doma. 

(5) If a is twice differentiable and Rana C te-Plane, then for all t G Doma, there is 
a At G Q such that Xta'{t) = a"{t). Furthermore, if o ^ Rana", the sign of Xt is 
the same for all t G Dom a and 

Ai>0 iff a'tta" 

(10.12) 
Ai<0 iff -a'tttt" 

Proof . Item (1) is easy to prove since by Lem. 10.5.1, ^ Ranar- Thus by CONT- 
Darboux's Theorem, we have that a!^ > or a:^ < 0. 

To prove Item (2), let us first note that a = {ar,a2,0, . . . ,0) since Rana C 
tx-Plane. Therefore, a' = {a'^, a!^, 0, . . . , 0). Hence a' tt Ix iff "r > 0' ^'^^ ^' tt-lx iff 

a;<o. 

To prove Item (3), let t G Doma. It is clear that a"{t) is spacelike or o since 
a"(t)_L^a'(t) by Prop. 10.5.7. Thus o ^ Rana" iff o ^ Rana". We have that a^j = 
{a2, 0, . . . , 0) G Q'^~^ since Rana C tx-Plane. Thus o ^ Rana" iff ^ Ranai^. Hence 
^ Rana2. Therefore, by CONT-Darboux's Theorem, we have that a'g > or a2 < 0. 
Consequently, a" ttl^ iff "2 > O5 and a" tt-l^ iff "2 < 0- Thus, since ^ Rana2, 
a" tflx iff 0^2 is increasing, and a" tt~la: iff 0^2 is decreasing. 

Let us now prove Item (4). Without losing generality, we can assume that the ver- 
tical plane is the tx-Plane. By Lem. 10.5.8, we have that 02 is increasing or decreasing 
since a" o /i < iff o ^ Rana". Thus Item (4) follows by Item (3). 

Let us finally prove Item (5). Since both a'{t) and a"{t) are Minkowski orthogonal 
to a'{t) and are in the tx-Plane, there is a A^ G Q such that a'{t) = Xta"{t). By Items 
(2) and (3), equation (10.12) is clear. HI' 

Let a and /3 be timelike curves. We say that /3* is the Minkowski reparametriza- 
tion of /3 according to a if 

/3^ = {{t,p) e Dom,a X Ran/3 : {p — a{t))±^a'{t)}. 

We say that (3 is at constant Minkowski distance r G Q"*" from a iff for all 

p G Ran (3, there is a t G Doma such that —ji{p,a{t)) = r. 
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Proposition 10.5.21. Assume CONT. Let a and /3 be definable timelike curves 
such that a is well-parametrized, and let /3* be the Minkowski reparametrization of /3 
according to a such that. 

(i) a is twice differentiable, and o ^ Rana" . 

(ii) Rana U Ran (3 is in a vertical plane. 

(iii) If (t,p) G /3* and (a(t) - p) tt«"(^), then -fi[p,a{t)) < -l//i(a"(r)) for all 
r G Donia. 

(iv) /3 is at constant Minkowski distance r G Q^ from a. 

Then /3^, is a definable timelike curve. Furthermore, 

P'^{t) = a'{t)+r- a"{t) iff a"{t) tt (/3*(t) - a(t)). 



/3:;(t) = a\t)-r- a'\t) iff a"(t) tt («W - /3*(i)) 
if i?ana U Ran [5 C ta;-Plane, a' ttlt and a" tflx- 



;i0.13) 



Proof . It is clear that (5^ is definable. 

To see that /3* is a function, let {t,q), {t,'p) G /3*. Then (p — g)±^a'(t). li p j^ q, 
they are timelike-separated by Lem. 10.5.11 since p,q & Ranfi. Thus, since two 
timelike vectors cannot be Minkowski orthogonal, we have that p = q ■ Hence P^, is a 
function. 

Without losing generality, we can assume that the vertical plane that contains 
Rana U Ranfi is the te-Plane, a' ttl* and a" V\lx-, see Lems. 10.5.8 and 10.5.20. 

Since (3 is at constant Minkowski distance r from a, 

/3,(t) = a{t) + r ■ a'{t) iff a\t) tt (/3*(t) - a(t)), 
/3,(t) = a{t) - r ■ a\t) iff a\t) tt («(t) - (3,{t)) 

if t G Dam P^. 

Since /3 is at constant Minkowski distance r G Q^ from a, we have that Rana fl 
Ran P = 0. Hence by Lem. 10.5.18, we have that the direction of /3*(t) — a{t) cannot 
change. Thus it is always the same equation in (10.14) that holds for /3^. 

Since a is twice differentiable, so is a. Thus both a + r-a' and a — r-a' are definable 
differentiable curves. 

Now we will show that a -|- r ■ a' is a timelike curve and if a'(t) tt {(^(t) ~ f^*(t)) for 
some t G Doni 13^,, then a — r ■ a' is also a timelike curve. It is clear that (a ± r ■ a')' = 
a' ±r-a". Let t G Dam a. By (5) in Lem. 10.5.20, we have that ji{a'{t) +r-a"{t)) = 
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fi{a'{t)) +r/i(a"(t)) and /i(a'(t) - r ■ a"(t)) = fi{a'{t)) -rfi{a"{t)). By Thin. 6.1.11, 
we have that /i(a'(t)) = 1. Thus /i((a + r ■ a')'(t)) > 0. Hence a + r ■ a' is a timehke 
curve. Since a' ftU and a" tt^x, we have that a"{t) tta'(^) by Lem. 10.5.20. Thus by 
assumption (iii) and the fact that /3 is at constant Minkowski distance r from a, we have 
that r < -l/yu(a"(r)) for all r G Doma if a'(t) tt ("(^) - /5*(^)) for some t G Doma. 
Since Ran a C tx-Plane, we have that fi{a"{t)) = —fi{a"{t)). Thus fi{a"{t)) < 1/r. 
Consequently, ji{a'{t) — r ■ a" {t)) > 0. Hence a — r ■ a' is also a timelike curve. 

Here we only prove that Dom (3^ is connected when a'{t) 'W (a(t) — /3*(t)) for some 
t G Dom /3* because the proof in the other case is almost the same. Let ti, ^2 ^ Dom /3*, 
and let t G (^1,^2)- Then ti, ^2 ^ -Dotti a, and thus t G Dotti a since Dom a is connected. 
Since a — r ■ a' is a timelike curve and cJ — r ■ a" X\\ti we have that 

/3,(ti) = a(ti) - r ■ a'(ti) < a(t) - r ■ a'(t) < 0(^2) - r ■ a'{t2) = /3*(t2)- 

Thus by CONT-Bolzano's Theorem, there is a t G Dom (3 such that (/3(t)— a(t))±^a'(t). 
Since (3 is at constant Minkowski distance r from a, we have that (3{t) = a{t)—r-a'{t). 
Hence t G Dom, (3^, as it was required. 

Since /3* agrees with one of the two timelike curves a + r ■ a' and a — r ■ a' on the 
connected set Dom (i^,, we have that /3^, is also a timelike curve. Since a" tt la; and 
a ttli we have that a" X\a' . Therefore, by derivation of the equations of (10.14), we 
have that the derivative of /3* is what was stated in (10.13). HI', 
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Chapter 11 

Why do we insist on using FOL for 
foundation? 



In this chapter we are going to give a detailed explanation why FOL is the best logic 
to be used in foundational works, such as this one. 

11.1 On the purposes of foundation 

The main purpose of foundation is to get a deeper understanding of fundamental con- 
cepts of a theory by stating axioms about them and studying the relationship between 
the axioms and their consequences. There are three main kinds of question to ask in 
the course of foundation: 

• What are the consequences of the given axioms? 

• What axioms are responsible for a certain theorem? 

• How do statements independent from the theory relate to one another? 

The first one is a usual question of ordinary axiomatic mathematics. The other two 
are new kinds of question in foundational thinking and reverse mathematics. The third 
one is meaningful only in the case of incomplete theories; but there are a lot of incom- 
plete theories, e.g., any consistent axiom system containing arithmetic is incomplete by 
Godel's first incompleteness theorem. Moreover, it is usually reasonable to weaken a 
complete theory to make it possible to ask this third type of question. For example, to 
facilitate studying the role of the axiom of parallels, Euclid's complete axiom system 
of geometry was weakened to an incomplete one. These three kinds of question are 
studied in the hope that they will lead to a more refined and deeper understanding of 
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the fundamental concepts and assumptions of the given theory. For more details on 
the role and importance of foundational thinking, see, e.g., [2, Introduction] and [24]. 

11.2 The success story of foundation in mathemat- 
ics 

Experience shows that foundational thinking does lead to deeper understanding. For 
example, in geometry it clarified the status of the axiom of parallels and led to the 
discovery of hyperbolic geometry. It has been shown by foundation that this axiom is 
independent from the other basic assumptions of Euclidean geometry. 

Foundation also eliminated Russel's antinomy from set theory and thus from math- 
ematics. It helped to gain a deeper understanding of many statements of set theory by 
providing many other statements which are weaker, equivalent or stronger according 
to some axiom system of set theory, such as the Zermelo-Fraenkel set theory (ZF). For 
example, the axiom of choice is equivalent to Zermelo's well-ordering theorem, Zorn's 
lemma and the existence of basis in every vector space; the Baire Category Theorem, 
Stone's representation Theorem and the Banach-Tarski paradox are some of its many 
consequences; and the statement "every subset of real numbers M is Lebesgue measur- 
able" is stronger than the negation of the axiom of choice. These results tell us more 
about what it means to postulate the axiom of choice or its negation. And there are 
lots of other statements of set theory which are investigated in this way. 

Second-order arithmetic^ is also a good example of the successfulness of founda- 
tional thinking. The main goal of second-order arithmetic is to investigate how strong 
a set existence axiom is needed to prove certain theorems of mathematics, such as the 
Bolzano- Weierstrass Theorem or Konig's Lemma. For more details, see, e.g., Simp- 
son [66]. 

The examples above show that foundational thinking has been fruitful in many fields 
of mathematics. Hence it seems to be a good idea to apply it in a wider range; for 
example, in certain fields of physics, such as relativity theory as suggested by Harvey 
Friedman [25], for instance. 

There are lots of interesting assumptions, statements and questions of relativity 
theory (both special and general), such as the possibility/impossibility of faster than 
light motion, the twin paradox, gravitational time dilation or the existence of closed 



^Let us note that the name second-order arithmetic is misleading since it is a two-sorted FOL 
theory, i.e., it is a FOL theory which studies two kinds of individuah sets and numbers. 
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timelike curves (i.e., the possibility of time travel), to mention only a few. There is 
much hope that foundation will help to clarify and understand the statuses of these 
statements and questions as well as the concepts related to them. This is one of the 
many reasons why the relatively large group led by Andreka Hajnal and Istvan Nemeti 
have devoted so much effort and enthusiasm to providing foundation for spacetime 
theories. See [69, pp.144 footnote 137] for a physicist's reflection on some of that. 



11.3 Choosing a logic for foundation 

To provide logical foundation of any field of science, we have to choose a formal logic. 
In the following sections we show that our choosing FOL is the best possible choice in 
several senses. To do so, we compare it to other logics from different aspects. 

Since we would like to treat the physical world as a possible model of our theory in 
certain physical interpretation, we need a logic with semantical consequence relation. 
Even after this restriction, there are a great many different logics which we could use 
for axiomatic foundation. The two most popular candidates are FOL and (standard 
or full) second-order logic. The main difference between them is that in second-order 
logic it is possible to quantify over n-aij relations while in FOL we can quantify just 
over individuals. 

Because of its great expressive power, it would be convenient to use second-order 
logic. However, as it will be showed in the forthcoming paragraphs, its great expressive 
power is rather a disadvantage. 

A main problem with second-order logic is that it contains tacit assumptions about 
sets. That is so because unary relations and subsets are essentially the same things. 
Hence if we use second-order logic, we tacitly build set theory into our theory, and that 
generates several problems. On the other hand, FOL does not contain any assumptions 
about sets.^ 

Vaananen in [78] says: "First-order set theory and second-order logic are not rad- 
ically different: the latter is a major fragment of the former." In [50, §Set theory in 
Sheep's Clothing], Quine also argues that second-order logic is none other than a set 
theory in disguise. So if we do not want to be burdened (or loaded) by any hidden 
assumptions about sets, we cannot use second-order logic for foundation. The same 
argument applies to standard higher-order logic and type theory. 



^Of course, to prove nontrivial theorems about FOL, we need some basic set theory as a metatheory. 
However, that does not contradict the fact that FOL is free of any hidden assumptions about sets. 
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11.4 Completeness 

Completeness is also a fundamental property of the logic we choose for foundation since 
without it we cannot have control over the true statements in the models of our axioms. 
FOL is complete by Godel's completeness theorem, but second-order logic is not, see, 
e.g., [20, §IX.l.]. That means that the semantical consequence relation of second-order 
logic is vague, which by itself is enough to exclude second-order logic from the list of 
possible logics for foundation. 

Let us, however, dwell on the vagueness of the semantical consequence relation of 
second-order logic. Not just there is no sound and complete system of derivation rules 
for second-order logic, but the set of Godel numbers of second-order logic validities is 
not definable by any second-order logic formula (in the standard model of arithmetic), 
see [22, Thm.41C]. Hence it is not just not recursively enumerable, but it is not at any 
level of the arithmetical hierarchy of FOL definable sets of numbers. 

The complexity of second-order validities in a language containing one binary re- 
lation symbol is also very high since it cannot be defined by any higher-order logic 
formula in the language of Peano arithmetic, and a formula of complexity 112 is needed 
to define it in the language of set theory, see Vaananen [78] . These results show that 
the validity relation of second-order logic is too blurred and vague for our purposes. 

In contrast, the set of FOL validities is recursively enumerable, see [20, §X. Prop.L6], 
and the set of consequences of any recursive enumerable FOL theory is recursively enu- 
merable by [22, Thm.35I] and [44, Thm.15.1]. 

11.5 Absoluteness 

Naturally, we would like to choose logic L such that its semantical consequence relation 
(|=l) is as independent from set theory as possible. This property of a logic is called 
absoluteness. Absoluteness of a logic roughly means that the truth or falsity of 971 |=l </? 
does not depend on the entire set theoretical universe, only on the sets required to exist 
by some fixed list of axioms (e.g., ZF or a fragment of ZF) and on the transitive closures 
of the sets 9Jt and ip under discussion. For exact definition, see, e.g., [12], [77]. 

Let us now see some examples that show what can happen if we use a non-absolute 
logic, such as second-order logic. We can formulate the continuum hypothesis (CH) 
in second-order logic, see, e.g., [20], [45], [60]. Let (fcH be a second-order formula 
expressing CH and let |=2 be the semantical consequence relation of second-order logic. 

Example 11.5.1. The answer to the simple question whether R \=2 ^ch oi M ^2 ^ch 
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holds, depends on the model of set theory we are working in. So it is unknowable. 
Moreover, this dependence is so strong that the answer may alter by moving from the 
set theoretical universe V we work in to a transitive submodel of V. 

Let us now see a more general example. 

Example 11.5.2. Let (poo be a formula of second-order logic expressing that its model 
contains infinitely many elements; it is not difficult to write up such a formula, see [20, 
§IX. 1.3]. Let ip be the following formula of second-order logic: ipoo — )■ 'PcH- Then 
for any infinite structure 971, the question whether 3Jt |=2 ^ or S!Jt ^2 ^ holds is also 
unknowable. 

On the basis of the many independent statements of set theory, we can generate a 
great many unknowable sentences of second-order logic. 

Let us note here that a statement being unknowable and being independent from a 
theory does not mean the same. Unknowability of a statement means that its validity 
depends on what class model of the metatheory we are working in. So unknowability is 
highly undesirable, while independence is not problematic at all. Moreover, in founda- 
tions, it is useful to study incomplete theories, see Section 11.1. Hence independence 
can be useful. 

The examples above show that absoluteness is a desired property of a logic used 
for foundation. It is important to note that the above situations cannot occur in FOL 
because it is absolute in a strong sense, i.e., it is absolute in relation to the Kripke- 
Platek set theory^ (KP), which is considerably weaker than ZF, see, e.g., [77, Example 
2.1.3]. 

11.6 Categoricity 

First of all let us note that, in logical foundation, the fewer axioms a theory contains 
the better it is; so categoricity (and even completeness) of an axiom system is not a 
desired property. Moreover, searching for strong (e.g., categorical or complete) axiom 
systems is a fallacy in foundation of a physical theory since in physics we do not really 
know whether an axiom is true or not, we just presume so. 

Nevertheless, it is often considered as a great advantage of second-order logic that it 
is possible to axiomatize something categorically within it, i.e., it can capture structures 



■^KP consists only the axioms of extensionality, foundation, pair, union, and the separation and 
coUcction schemas restricted to formulas containing only bounded quantifiers, see, e.g., [11] 
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up to isomorphism. The following examples will show that categoricity is rather a 
disadvantage as it can obscure things we are interested in. 

Example 11.6.1. Let us consider the "nice" (finite and categorical) second-order 
axiomatization RCF2 of real numbers. Since RCF2 is categorical, there is only one 
model of it (M). Now we can think that we have captured what we wanted and nothing 
else. However, if we take a closer look, we will see that we can ask many unanswerable 
questions about M. For example, since CH is independent from set theory, we do not 
know whether there is or there is not an uncountable subset if of M such that there is 
no bijection between H and M. That is inconvenient because there is only one model of 
RCF2. So either M \=2 (fcH or M \=2 ~"^ch must be valid but we cannot know which 
one. At first glance it is not clear at all how a concrete yes-or-no question can exist 
without a definite answer? The problem results from the fact that we have captured 
one M in each set theory model. However, since there are several models of set theory, 
we have several M's, too. 

Let us now see another example which shows that we can lose important information 
about the model we intend to capture if we use second-order logic. 

Example 11.6.2. In second-order logic, thanks to its expressive power, we can for- 
mulate an axiom that states that if CH is true, there is an isomorphism between its 
model and N, and if CH is false, its model is isomorphic to the ordered ring of integers 

(zy. 



(ipcH ^ 971 = N) A i^ipcH -> M ^ Z). 



So the axiom system containing the above formula only is categorical, yet it is un- 
knowable whether there is a least element of the structure which is captured up to 
isomorphism. 

By the trick of the above example we can provide many categorical axiom systems 
where some very basic properties (e.g., finiteness/infiniteness) about the unique model 
are unknowable, see Andreka-Madarasz-Nemeti [2, §Why FOL?]. 

These examples show that categoricity is not at all as good a thing as it seems to 
be, and sometimes a non-categorical FOL axiomatization can provide more information 
about its several models than a categorical second-order logic axiomatization can about 
its unique model. Second-order logic only makes us believe that we have one particular 
object in hand, but in fact, we have many. 
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11.7 Henkin semantics of second-order logic 

There are also other semantics of second-order logic in addition to standard (or full) 
semantics where all the relations are present. Henkin further generalized standard se- 
mantics and introduced such ones in which just some of the relations (but at least all 
the definable ones) are present such that these relations satisfy certain requirements, see 
Henkin [30]. Vaananen in [78] argues that if second-order logic is used for foundation, 
we cannot meaningfully ask which semantics is being used. That is so because eventu- 
ally everything boils down to writing proofs in the computable inference system of the 
logic used. So the standard version of second-order logic cannot be used for foundation, 
only the generalized Henkin second-order logic is suitable for this purpose. The Henkin 
second-order logic is actually a theory of many-sorted FOL, so one can only pretend 
using standard second-order logic for foundation. Furthermore, when mathematicians 
are apparently using higher-order logic, they are actually using Henkin higher-order 
one. 

In the present approach Henkin higher-order logic is considered absolutely accept- 
able. It has a completeness theorem and is absolute. So we do not hesitate to use it 
when needed. Hence higher-order logic tools are acceptable and available for us if they 
are treated with appropriate caution. 

11.8 Our choice of logic in the light of Lindstrom's 
Theorem 

So far we have mainly argued for choosing a complete and absolute logic for foundation, 
such as FOL. Thus second-order logic and hence any higher-order logic is too strong for 
our aims. However, there are many model-theoretic logics which are stronger than FOL 
but weaker than second-order logic, e.g., weak second-order logic, infinitary logics or 
logics with generalized quantifiers, etc. Can any of these logics be good for our purpose? 
So our question can be restated as follows: Is there any complete and absolute model- 
theoretic logic stronger than FOL?^ To answer this question, let us recall two properties 
of abstract model-theoretic logics. A logic is compact iff every set of sentences S of the 
logic has a model if all finite subsets of S have models. A logic has the Lowenheim- 
Skolem property iff every sentences ip of the logic has a countable model if if has a 
model. There is a well-known theorem of Lindstrom that characterizes FOL as the 
strongest model-theoretic logic with these properties, see, e.g., [23]. 



*For the exact definition of comparing tlie strengtli of two logics, see [19]. 
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Theorem 11.8.1 (Lindstrom). FOL is the strongest compact model-theoretic logic 
with Lowenheim-Skolem property. 

However, Lindstrom's theorem does not answer our question by itself since we have 
required two different properties (absoluteness and completeness) of a logic to judge it 
suitable for foundation. A theorem of Vaananen's comes to our aid, see Cor. 2. 2. 3 in 

[77]. 

Theorem 11.8.2 (Vaananen). Every absolute model-theoretic logic has the Lowenheim- 
Skolem property. 

Since completeness implies compactness, by putting Lindstrom's and Vaananen's the- 
orems together, we get the following: 

Corollary 11.8.3. FOL is the strongest abstract model-theoretic logic which is com- 
plete and absolute. 

This corollary implies that FOL is the strongest abstract model-theoretic logic suit- 
able for foundation. 
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